bodkan / admixr

Licence: other

An R package for reproducible and automated ADMIXTOOLS analyses

Programming Languages

7636 projects

Makefile

30231 projects

Projects that are alternatives of or similar to admixr

CANDO

Computational Analysis of Novel Drug Opportunities

Stars: ✭ 27 (+35%)

Mutual labels: bioinformatics

BridgeDb

The BridgeDb Library source code

Stars: ✭ 22 (+10%)

Mutual labels: bioinformatics

sirius

SIRIUS is a software for discovering a landscape of de-novo identification of metabolites using tandem mass spectrometry. This repository contains the code of the SIRIUS Software (GUI and CLI)

Stars: ✭ 32 (+60%)

Mutual labels: bioinformatics

wgs2ncbi

Toolkit for preparing genomes for submission to NCBI

Stars: ✭ 25 (+25%)

Mutual labels: bioinformatics

gnparser

GNparser normalises scientific names and extracts their semantic elements.

Stars: ✭ 26 (+30%)

Mutual labels: bioinformatics

dysgu

dysgu-SV is a collection of tools for calling structural variants using short or long reads

Stars: ✭ 47 (+135%)

Mutual labels: bioinformatics

ensembl-compara

The Ensembl Compara Perl API and SQL schema

Stars: ✭ 43 (+115%)

Mutual labels: bioinformatics

paccmann datasets

pytoda - PaccMann PyTorch Dataset Classes. Read the docs: https://paccmann.github.io/paccmann_datasets/

Stars: ✭ 15 (-25%)

Mutual labels: bioinformatics

sample-sheet

A permissively licensed library designed to replace Illumina's Experiment Manager

Stars: ✭ 42 (+110%)

Mutual labels: bioinformatics

CeleScope

Single Cell Analysis Pipelines

Stars: ✭ 36 (+80%)

Mutual labels: bioinformatics

PrimerMiner

R mased batch sequence downloader, with primer development and in silico evaluation capabilities

Stars: ✭ 27 (+35%)

Mutual labels: bioinformatics

TeamTeri

Genomics using open source tools, running on GCP or AWS

Stars: ✭ 30 (+50%)

Mutual labels: bioinformatics

awesome-phages

A curated list of phage related software and computational resources for phage scientists, bioinformaticians and enthusiasts.

Stars: ✭ 14 (-30%)

Mutual labels: bioinformatics

PHAT

Pathogen-Host Analysis Tool - A modern Next-Generation Sequencing (NGS) analysis platform

Stars: ✭ 17 (-15%)

Mutual labels: bioinformatics

diploSHIC

feature-based deep learning for the identification of selective sweeps

Stars: ✭ 36 (+80%)

Mutual labels: population-genetics

Scaff10X

Pipeline for scaffolding and breaking a genome assembly using 10x genomics linked-reads

Stars: ✭ 21 (+5%)

Mutual labels: bioinformatics

protwis

Protwis is the backbone of the GPCRdb. The GPCRdb contains reference data, interactive visualisation and experiment design tools for G protein-coupled receptors (GPCRs).

Stars: ✭ 20 (+0%)

Mutual labels: bioinformatics

codon-usage-tables

📊 Codon usage tables in code-friendly format + Python bindings

Stars: ✭ 21 (+5%)

Mutual labels: bioinformatics

rkmh

Classify sequencing reads using MinHash.

Stars: ✭ 42 (+110%)

Mutual labels: bioinformatics

GRAFIMO

GRAph-based Finding of Individual Motif Occurrences

Stars: ✭ 22 (+10%)

Mutual labels: bioinformatics

View All Similar Projects ➔

admixr

Introduction

ADMIXTOOLS is a widely used software package for calculating admixture statistics and testing population admixture hypotheses.

A typical ADMIXTOOLS workflow often involves a combination of sed/awk/shell scripting and manual editing to create different configuration files. These are then passed as command-line arguments to one of ADMIXTOOLS commands, and control how to run a particular analysis. The results are then redirected to another file, which has to be parsed by the user to extract values of interest, often using command-line utilities again or (worse) by manual copy-pasting. Finally, the processed results are analysed in R, Excel or another program.

This workflow can be a little cumbersome, especially if one wants to explore many hypotheses involving different combinations of populations. Most importantly, however, it makes it difficult to follow the rules of best practice for reproducible science, as it is nearly impossible to construct reproducible automated "pipelines".

This R package makes it possible to perform all stages of an ADMIXTOOLS analysis entirely from R. It provides a set of convenient functions that completely remove the need for "low level" configuration of individual ADMIXTOOLS programs, allowing users to focus on the analysis itself.

How to cite

admixr is now published as an Application Note in the journal Bioinformatics. If you use it in your work, please cite the paper!

Installation instructions

Latest stable version

The package is available on CRAN. You can install it simply by running

install.packages("admixr")

from your R session. This the recommended procedure for most users.

Development version

To install the development version from Github (which might be slightly ahead in terms of new features and bugfixes compared to the stable release on CRAN), you need the package devtools. You can run:

install.packages("devtools")
devtools::install_github("bodkan/admixr")

Installing ADMIXTOOLS

In order to use the admixr package, you need a working installation of ADMIXTOOLS. You can find installation instructions here.

Furthermore, you also need to make sure that R can find ADMIXTOOLS binaries on the $PATH. You can achieve this by specifying PATH=<path to the location of ADMIXTOOLS programs> in the .Renviron file.

Example

This is all the code that you need to perform ADMIXTOOLS analyses using this package! No shell scripting, no copy-pasting and manual editing of text files. The only thing you need is a working ADMIXTOOLS installation and a path to EIGENSTRAT data (a trio of ind/snp/geno files), which we call prefix here.

library(admixr)

# download a small testing dataset to a temporary directory and
# process it for use in R
snp_data <- eigenstrat(download_data())

result <- d(
  W = c("French", "Sardinian"), X = "Yoruba", Y = "Vindija", Z = "Chimp",
  data = snp_data
)

result
#> # A tibble: 2 × 10
#>   W         X      Y       Z          D  stderr Zscore  BABA  ABBA  nsnps
#>   <chr>     <chr>  <chr>   <chr>  <dbl>   <dbl>  <dbl> <dbl> <dbl>  <dbl>
#> 1 French    Yoruba Vindija Chimp 0.0313 0.00693   4.51 15802 14844 487753
#> 2 Sardinian Yoruba Vindija Chimp 0.0287 0.00679   4.22 15729 14852 487646

Note that a single call to the d function generates all required intermediate config and population files, runs ADMIXTOOLS, parses its log output and returns the result as a data.frame object. It does all of this behind the scenes, without the user having to deal with low-level technical details.

More information

To see many more examples of admixr in action, please check out the tutorial vignette.

If you want to stay updated on new admixr development, follow me on Twitter.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

bodkan / admixr

Programming Languages

Labels

Projects that are alternatives of or similar to admixr

admixr

Introduction

How to cite

Installation instructions

Latest stable version

Development version

Installing ADMIXTOOLS

Example

More information