All Projects → knausb → Vcfr

knausb / Vcfr

Tools to work with variant call format files

Programming Languages

r
7636 projects

Projects that are alternatives of or similar to Vcfr

Biomartr
Genomic Data Retrieval with R
Stars: ✭ 144 (-3.36%)
Mutual labels:  genomics, rstats
Mixomics
Development repository for the Bioconductor package 'mixOmics '
Stars: ✭ 58 (-61.07%)
Mutual labels:  genomics, rstats
wdlRunR
Elastic, reproducible, and reusable genomic data science tools from R backed by cloud resources
Stars: ✭ 34 (-77.18%)
Mutual labels:  genomics, rstats
Streamgraph
〰️ htmlwidget for creating streamgraph visualizations in R
Stars: ✭ 137 (-8.05%)
Mutual labels:  rstats
Rnaturalearth
an R package to hold and facilitate interaction with natural earth map data 🌍
Stars: ✭ 140 (-6.04%)
Mutual labels:  rstats
Complex Upset
A library for creating complex UpSet plots with ggplot2 geoms
Stars: ✭ 147 (-1.34%)
Mutual labels:  rstats
G2r
📈 Grammar of graphics for interactive visualisations
Stars: ✭ 150 (+0.67%)
Mutual labels:  rstats
Dataspice
🌶 Create lightweight schema.org descriptions of your datasets
Stars: ✭ 137 (-8.05%)
Mutual labels:  rstats
Shinyalert
🗯️ Easily create pretty popup messages (modals) in Shiny
Stars: ✭ 148 (-0.67%)
Mutual labels:  rstats
Rcade
Games to procrastinate with RStudio
Stars: ✭ 146 (-2.01%)
Mutual labels:  rstats
Textfeatures
👷‍♂️ A simple package for extracting useful features from character objects 👷‍♀️
Stars: ✭ 148 (-0.67%)
Mutual labels:  rstats
Googlelanguager
R client for the Google Translation API, Google Cloud Natural Language API and Google Cloud Speech API
Stars: ✭ 145 (-2.68%)
Mutual labels:  rstats
Rtutor
Creating interactive R Problem Sets. Automatic hints and solution checks. (Shiny or RStudio)
Stars: ✭ 141 (-5.37%)
Mutual labels:  rstats
Smoove
structural variant calling and genotyping with existing tools, but, smoothly.
Stars: ✭ 147 (-1.34%)
Mutual labels:  genomics
Docxtractr
✂️ Extract Tables from Microsoft Word Documents with R
Stars: ✭ 139 (-6.71%)
Mutual labels:  rstats
Pins
Pin, Discover and Share Resources
Stars: ✭ 149 (+0%)
Mutual labels:  rstats
Hgvs
Python library to parse, format, validate, normalize, and map sequence variants. `pip install hgvs`
Stars: ✭ 138 (-7.38%)
Mutual labels:  genomics
Colourpicker
🎨 A colour picker tool for Shiny and for selecting colours in plots (in R)
Stars: ✭ 144 (-3.36%)
Mutual labels:  rstats
Fasterize
High performance raster conversion for modern spatial data 🚀🌏▦
Stars: ✭ 146 (-2.01%)
Mutual labels:  rstats
Gender
Predict Gender from Names Using Historical Data
Stars: ✭ 149 (+0%)
Mutual labels:  rstats

VcfR: a package to manipulate and visualize VCF data in R

On CRAN: CRAN_Status_Badge

Depsy: Research software impact

Travis-CI (Ubuntu Linux): Travis-CI Build Status

Appveyor (Windows): AppVeyor Build Status

Coveralls: Coverage Status


Supercontig_50

VcfR is an R package intended to allow easy manipulation and visualization of variant call format (VCF) data. Functions are provided to rapidly read from and write to VCF files. Once VCF data is read into R a parser function extracts matrices from the VCF data for use with typical R functions. This information can then be used for quality control or other purposes. Additional functions provide visualization of genomic data. Once processing is complete data may be written to a VCF file or converted into other popular R objects (e.g., genlight, DNAbin). VcfR provides a link between VCF data and the R environment connecting familiar software with genomic data.

VcfR is built upon two data structures.

vcfR - S4 class to contain data from a VCF file.

chromR - S4 class to contain variant information (VCF) and optional sequence (FASTA) and annotation (GFF) information.

Functions in vcfR provide the ability to subset VCF data as well as to extract and parse the data. For example, individual genotypes, sequence depths or genotype likelihoods (when provided in the VCF file) can easily be accessed. These tools are provided to aid researchers in rapidly surveying the quality and other characteristics of data provided as VCF data. With this information in hand, researchers should be able to determine criteria for hard filtering in order to attempt to maximize biological variation and minimize technical variation.

Documentation

Documentation for vcfR can now be found here: vcfR_documentation.

We also have Population genetics and genomics in R which is more general and provides examples of analyses.

If you think you've found a bug, please see reporting an issue.

Publication

vcfR articles

Knaus, Brian J., and Niklaus J. Grunwald. 2017. VCFR: a package to manipulate and visualize variant call format data in R. Molecular Ecology Resources 17(1):44-53. http://dx.doi.org/10.1111/1755-0998.12549.

Knaus, Brian J., and Niklaus J. Grunwald. 2016. VcfR: an R package to manipulate and visualize VCF format data. bioRxiv: 041277. http://dx.doi.org/10.1101/041277.

Copy number variation article

Knaus, Brian, and Niklaus J. Grünwald. 2018. Inferring variation in copy number using high throughput sequencing data in R. Frontiers in Genetics 9: 123. http://dx.doi.org/10.3389/fgene.2018.00123.

Download

vcfR is available at CRAN. To install use:

install.packages('vcfR')

The development version can be installed through github:

devtools::install_github(repo="knausb/vcfR")
library(vcfR)

If you would like the vignettes use:

devtools::install_github(repo="knausb/vcfR", build_vignettes=TRUE)

If you've built the vignettes, you can browse them with:

browseVignettes(package="vcfR")

If you've installed this package with devtools you will probably need to run:

devtools::install(build_vignettes = TRUE)

Devel branch

The devel branch (which may not be stable) can also be installed:

devtools::install_github(repo="knausb/[email protected]")
library(vcfR)

And to build the vignettes:

devtools::install_github(repo="knausb/[email protected]", build_vignettes=TRUE)

Software that produce VCF files

A fun part of this project has been learning about how people use vcfR. One facet of this is learning about the software that create VCF files. So I've decided to make a list of these software. If you know of a software that I have not included on this list, particularly if you can report that vcfR works with its files, feel free to let me know!

Genomic variant callers:

Restriction site associated DNA markers (e.g., RADseq, GBS):

Manipulation of VCF data:

  • Beagle v4.1 Inputs VCF genotypes and outputs phased genotypes to VCF format
  • pegas::read.vcf Population and Evolutionary Genetics Analysis System
  • PyVCF
  • SnpEff Genetic variant annotation and effect prediction toolbox
  • Picard A set of command line tools (in Java) for manipulating high-throughput sequencing (HTS) data and formats such as SAM/BAM/CRAM and VCF
  • VCF-kit VCF-kit is a command-line based collection of utilities for performing analysis on Variant Call Format (VCF) files.
  • VCFtools General manipulation and analysis
  • VariantAnnotation::readVcf Bioconductor package for annotating variants

R packages that read VCF data


Enjoy!

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].