All Projects → kkdey → Logolas

kkdey / Logolas

Licence: GPL-3.0 license
R package for Enrichment Depletion Logos (EDLogos) and String Logos

Programming Languages

r
7636 projects

Projects that are alternatives of or similar to Logolas

Xible
Visualize your workflow
Stars: ✭ 49 (+96%)
Mutual labels:  flexible, string
vbml
Way to check, match and resist.
Stars: ✭ 27 (+8%)
Mutual labels:  string
nlcst-to-string
utility to transform an nlcst tree to a string
Stars: ✭ 16 (-36%)
Mutual labels:  string
string
Multibyte strings as objects
Stars: ✭ 58 (+132%)
Mutual labels:  string
justified
Wrap, align and justify the words in a string.
Stars: ✭ 30 (+20%)
Mutual labels:  string
NaturalSort.Extension
🔀 Extension method for StringComparison that adds support for natural sorting (e.g. "abc1", "abc2", "abc10" instead of "abc1", "abc10", "abc2").
Stars: ✭ 94 (+276%)
Mutual labels:  string
NString
A collection of utilities for working with strings in .NET.
Stars: ✭ 34 (+36%)
Mutual labels:  string
TairString
A redis module, similar to redis string, but you can set expire and version for the value. It also provides many very useful commands, such as cas/cad, etc.
Stars: ✭ 99 (+296%)
Mutual labels:  string
esl
Lightweight and flexible UI component library based on web components technology for creating basic UX modules
Stars: ✭ 53 (+112%)
Mutual labels:  flexible
right-align
Right-align the text in a string.
Stars: ✭ 19 (-24%)
Mutual labels:  string
director
Director is a production-ready supervisor and manager for Erlang/Elixir processes that focuses on speed, performance and flexibility.
Stars: ✭ 62 (+148%)
Mutual labels:  flexible
fuseml
FuseML aims to provide an MLOps framework as the medium dynamically integrating together the AI/ML tools of your choice. It's an extensible tool built through collaboration, where Data Engineers and DevOps Engineers can come together and contribute with reusable integration code.
Stars: ✭ 73 (+192%)
Mutual labels:  flexible
problem-solving
No description or website provided.
Stars: ✭ 56 (+124%)
Mutual labels:  string
pascalcase
Convert a string to pascal case (upper camel case). Used by more than 8.7 million projects on GitHub! Please follow this library's author: https://github.com/jonschlinkert
Stars: ✭ 35 (+40%)
Mutual labels:  string
php-helpers
An extensive set of PHP helper functions and classes.
Stars: ✭ 27 (+8%)
Mutual labels:  string
memchr
Optimized string search routines for Rust.
Stars: ✭ 474 (+1796%)
Mutual labels:  string
DS ALGO
Data Structures and algorithms
Stars: ✭ 20 (-20%)
Mutual labels:  string
cpp-code-snippets
Some useful C++ code snippets
Stars: ✭ 35 (+40%)
Mutual labels:  string
html-comment-regex
Regular expression for matching HTML comments
Stars: ✭ 15 (-40%)
Mutual labels:  string
parse-author
Parse a person, author, contributor or maintainer string into an object with name, email and url properties following NPM conventions. Useful for the `authors` property in package.json or for parsing an AUTHORS file into an array of person objects.
Stars: ✭ 23 (-8%)
Mutual labels:  string

Logolas

Logolas is an R package for Enrichment Depletion Logo plots with string symbols, that highlights both enrichment and depletion of symbols, as opposed to standard logo plots, as in seqLogo package, that are biased towards highlighting enrichments. Logolas also generalizes logo plots to use both characters and strings.

If you find a bug, please create an issue.

This code has been tested in ...

misc

License

Copyright (c) 2018-2019, Kushal Dey.

All source code and software in this repository are made available under the terms of the GNU General Public License. See the LICENSE file for the full text of the license.

Citing this work

If you find that this R package is useful for your work, please cite our paper which is out on BMC Bioinformatics:

Dey, K.K., Xie, D. and Stephens, M., 2018. A new sequence logo plot to highlight enrichment and depletion. BMC Bioinformatics. 19:473 https://doi.org/10.1186/s12859-018-2489-3.

Quick Start

The most recent version of Logolas is available from Github using devtools R package.First, you would require to install the following Bioconductor packages.

source("https://bioconductor.org/biocLite.R")
biocLite(c("Biostrings","BiocStyle","Biobase","seqLogo","ggseqlogo"))

Then install Logolas as follows

library(devtools)
install_github("kkdey/Logolas",build_vignettes = TRUE)

Once you have installed the package, load the package in R by entering

library(Logolas)

To get an overview of the package, enter

help(package = "Logolas")

Next, try creating a few plots using the logomaker function:

Create a standard Logo plot in Logolas, analogous to seqLogo and ggseqLogo R packages.

sequence <- c("CTATTGT","CTCTTAT","CTATTAA","CTATTTA", "CTATTAT","CTTGAAT",
              "CTTAGAT","CTATTAA","CTATTTA","CTATTAT", "CTTTTAT","CTATAGT",
              "CTATTTT","CTTATAT","CTATATT","CTCATTT", "CTTATTT","CAATAGT",
              "CATTTGA","CTCTTAT","CTATTAT","CTTTTAT", "CTATAAT","CTTAGGT",
              "CTATTGT","CTCATGT","CTATAGT", "CTCGTTA","CTAGAAT","CAATGGT")
logomaker(sequence,type = "Logo")

misc

The corresponding EDLogo plot highlights the depletion of T in the middle, not visually clear in the standard logo plot.

logomaker(sequence, type = "EDLogo")

misc

One can also apply EDLogo for amino acid motifs, marked by alphabets beyond A, C, G and T as in DNA motifs.

We create an EDLogo plot on the amino acid sequences at N-Glycosylation sites, with a user specified background bg chosen to be the median psoitional weight of an aminoa acid in the context around the glycosylation site [data from Uniprotkb].

data("N_Glycosyl_sequences")
bg <- apply(N_Glycosyl_sequences, 1, function(x) return(median(x)))
bg <- bg/sum(bg)
logomaker(N_Glycosyl_sequences, type = "EDLogo", bg=bg)

misc

EDLogo highlights the motif Asn (N) -X- Ser (S)/Thr (T) -X motif at the center where X is depleted for the amino acid Pro (P).

Logolas allows the symbols in the logo plot to be a combination of strings and charcaters or be purely strings - examples of which are shown below

For a mutation signature (mismatch type at the center with flanking bases) example (data from Shiraishi et al 2015).

data(mutation_sig)
logomaker(mutation_sig, type = "EDLogo", color_type = "per_symbol",  color_seed = 2000)

misc

EDLogo plot for the enrichment and depletion of histone marks in different parts of the genome (data from Koch et al 2007).

data(histone_marks)
logomaker(histone_marks$mat, bg = histone_marks$bgmat, type = "EDLogo")

misc

Finally, please walk through some more detailed examples in the vignette:

vignette("Logolas")

Developer notes

This was the R command used to generate the vignette PDF file from the R Markdown source:

render("Logolas.Rmd",output_format="pdf_document")

Credits

This software was developed by Kushal Dey, Dongyue Xie and Matthew Stephens at the University of Chicago. For any questions or comments, please contact Kushal Dey at [email protected].

The authors would like to acknowledge Oliver Bembom, the author of the seqLogo package which acted as an inspiration and starting point for this software. The authors also thank Peter Carbonetto, Edward Wallace and John Blischak for helpful discussions and feedback.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].