All Projects → rnabioco → valr

rnabioco / valr

Licence: Unknown, MIT licenses found Licenses found Unknown LICENSE MIT LICENSE.md
Genome Interval Arithmetic in R

Programming Languages

r
7636 projects
C++
36643 projects - #6 most used programming language
c
50402 projects - #5 most used programming language

Projects that are alternatives of or similar to valr

16gt
Simultaneous detection of SNPs and Indels using a 16-genotype probabilistic model
Stars: ✭ 26 (-66.67%)
Mutual labels:  genome
Viral Ngs
Viral genomics analysis pipelines
Stars: ✭ 150 (+92.31%)
Mutual labels:  genome
Abyss
🔬 Assemble large genomes using short reads
Stars: ✭ 219 (+180.77%)
Mutual labels:  genome
Gatk
Official code repository for GATK versions 4 and up
Stars: ✭ 1,002 (+1184.62%)
Mutual labels:  genome
Biomartr
Genomic Data Retrieval with R
Stars: ✭ 144 (+84.62%)
Mutual labels:  genome
Genometools
GenomeTools genome analysis system.
Stars: ✭ 186 (+138.46%)
Mutual labels:  genome
Mosdepth
fast BAM/CRAM depth calculation for WGS, exome, or targeted sequencing
Stars: ✭ 376 (+382.05%)
Mutual labels:  genome
LTRpred
De novo annotation of young retrotransposons
Stars: ✭ 35 (-55.13%)
Mutual labels:  genome
Soapdenovo2
Next generation sequencing reads de novo assembler.
Stars: ✭ 150 (+92.31%)
Mutual labels:  genome
Genomepy
Download and use genomes the easy way.
Stars: ✭ 209 (+167.95%)
Mutual labels:  genome
Arcs
🌈Scaffold genome sequence assemblies using linked read sequencing data
Stars: ✭ 67 (-14.1%)
Mutual labels:  genome
Augustus
Genome annotation with AUGUSTUS
Stars: ✭ 129 (+65.38%)
Mutual labels:  genome
Karyoploter
karyoploteR - An R/Bioconductor package to plot arbitrary data along the genome
Stars: ✭ 192 (+146.15%)
Mutual labels:  genome
Ai Programmer
Using artificial intelligence and genetic algorithms to automatically write programs. Tutorial: http://www.primaryobjects.com/cms/article149
Stars: ✭ 948 (+1115.38%)
Mutual labels:  genome
genome updater
Bash script to download/update snapshots of files from NCBI genomes repository (refseq/genbank) with track of changes and without redundancy
Stars: ✭ 93 (+19.23%)
Mutual labels:  genome
Metacache
memory efficient, fast & precise taxnomomic classification system for metagenomic read mapping
Stars: ✭ 26 (-66.67%)
Mutual labels:  genome
Ribbon
A genome browser that shows long reads and complex variants better
Stars: ✭ 184 (+135.9%)
Mutual labels:  genome
DNA-Sequence-Machine-learning
Understand DNA structure and how machine learning can be used to work with DNA sequence data.
Stars: ✭ 25 (-67.95%)
Mutual labels:  genome
MGSE
Mapping-based Genome Size Estimation (MGSE) performs an estimation of a genome size based on a read mapping to an existing genome sequence assembly.
Stars: ✭ 22 (-71.79%)
Mutual labels:  genome
Deepvariant
DeepVariant is an analysis pipeline that uses a deep neural network to call genetic variants from next-generation DNA sequencing data.
Stars: ✭ 2,404 (+2982.05%)
Mutual labels:  genome

valr

R-CMD-check codecov

valr provides tools to read and manipulate genome intervals and signals, similar to the BEDtools suite.

Installation

The latest stable version can be installed from CRAN:

install.packages('valr')

The latest development version can be installed from github:

# install.packages("devtools")
devtools::install_github('rnabioco/valr')

valr Example

Functions in valr have similar names to their BEDtools counterparts, and so will be familiar to users coming from the BEDtools suite. Unlike other tools that wrap BEDtools and write temporary files to disk, valr tools run natively in memory. Similar to pybedtools, valr has a terse syntax:

library(valr)
library(dplyr)

snps <- read_bed(valr_example('hg19.snps147.chr22.bed.gz'), n_fields = 6)
genes <- read_bed(valr_example('genes.hg19.chr22.bed.gz'), n_fields = 6)

# find snps in intergenic regions
intergenic <- bed_subtract(snps, genes)
# find distance from intergenic snps to nearest gene
nearby <- bed_closest(intergenic, genes)

nearby %>%
  select(starts_with('name'), .overlap, .dist) %>%
  filter(abs(.dist) < 5000)
#> # A tibble: 1,045 × 4
#>    name.x      name.y   .overlap .dist
#>    <chr>       <chr>       <int> <int>
#>  1 rs530458610 P704P           0  2579
#>  2 rs2261631   P704P           0  -268
#>  3 rs570770556 POTEH           0  -913
#>  4 rs538163832 POTEH           0  -953
#>  5 rs190224195 POTEH           0 -1399
#>  6 rs2379966   DQ571479        0  4750
#>  7 rs142687051 DQ571479        0  3558
#>  8 rs528403095 DQ571479        0  3309
#>  9 rs555126291 DQ571479        0  2745
#> 10 rs5747567   DQ571479        0 -1778
#> # … with 1,035 more rows
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].