All Projects → HelikarLab → candis

HelikarLab / candis

Licence: GPL-3.0 license
🎀 A data mining suite for gene expression data.

Programming Languages

javascript
184084 projects - #8 most used programming language
python
139335 projects - #7 most used programming language
Jupyter Notebook
11667 projects
Dockerfile
14818 projects
shell
77523 projects
CSS
56736 projects

Projects that are alternatives of or similar to candis

LeetCode
At present contains scraped data from around 1500 problems present on the site. More to follow....
Stars: ✭ 45 (+60.71%)
Mutual labels:  data-mining
TIGER
Python toolbox to evaluate graph vulnerability and robustness (CIKM 2021)
Stars: ✭ 103 (+267.86%)
Mutual labels:  data-mining
FEATHER
The reference implementation of FEATHER from the CIKM '20 paper "Characteristic Functions on Graphs: Birds of a Feather, from Statistical Descriptors to Parametric Models".
Stars: ✭ 34 (+21.43%)
Mutual labels:  data-mining
old nesta daps
[archived]
Stars: ✭ 16 (-42.86%)
Mutual labels:  data-mining
arrhythmia-ecg-analysis-ai
Arrhythmia Classification through Characteristics Extraction with Discrete Wavelet Transform & WEKA/MATLAB Supervised Training
Stars: ✭ 25 (-10.71%)
Mutual labels:  weka
2018-JData-Unicom-RiskUser
2018-JData-联通-基于移动网络通讯行为的风险用户识别:Baseline 0.77
Stars: ✭ 20 (-28.57%)
Mutual labels:  data-mining
PracticalMachineLearning
A collection of ML related stuff including notebooks, codes and a curated list of various useful resources such as books and softwares. Almost everything mentioned here is free (as speech not free food) or open-source.
Stars: ✭ 60 (+114.29%)
Mutual labels:  data-mining
XCloud
Official Code for Paper <XCloud: Design and Implementation of AI Cloud Platform with RESTful API Service> (arXiv1912.10344)
Stars: ✭ 58 (+107.14%)
Mutual labels:  data-mining
iww
AI based web-wrapper for web-content-extraction
Stars: ✭ 61 (+117.86%)
Mutual labels:  data-mining
HFT-Prediction
Machine learning approach to high frequency trading, MLP & RNN used
Stars: ✭ 19 (-32.14%)
Mutual labels:  data-mining
TeBaQA
A question answering system which utilises machine learning.
Stars: ✭ 17 (-39.29%)
Mutual labels:  weka
python-notebooks
A collection of Jupyter Notebooks used in conferences or just to have some snippets.
Stars: ✭ 14 (-50%)
Mutual labels:  data-mining
The-Purchase-and-Redemption-Forecast-Challenge-baseline
天池“资金流入流出预测——挑战baseline”的解决方案,线上效果143.5
Stars: ✭ 78 (+178.57%)
Mutual labels:  data-mining
heidi
heidi : tidy data in Haskell
Stars: ✭ 24 (-14.29%)
Mutual labels:  data-mining
tf-idf-python
Term frequency–inverse document frequency for Chinese novel/documents implemented in python.
Stars: ✭ 98 (+250%)
Mutual labels:  data-mining
neuromantic
Latest Data Science Materials
Stars: ✭ 27 (-3.57%)
Mutual labels:  data-mining
dee2
Digital Expression Explorer 2 (DEE2): a repository of uniformly processed RNA-seq data
Stars: ✭ 32 (+14.29%)
Mutual labels:  data-mining
chainRec
Mengting Wan, Julian McAuley, "Item Recommendation on Monotonic Behavior Chains", in Proc. of 2018 ACM Conference on Recommender Systems (RecSys'18), Vancouver, Canada, Oct. 2018.
Stars: ✭ 52 (+85.71%)
Mutual labels:  data-mining
Loan-Approval-Prediction
Loan Application Data Analysis
Stars: ✭ 61 (+117.86%)
Mutual labels:  data-mining
tree-hugger
A light-weight, extendable, high level, universal code parser built on top of tree-sitter
Stars: ✭ 96 (+242.86%)
Mutual labels:  data-mining

A data mining suite for gene expression data

Candis is an open source data mining suite (released under the GNU General Public License v3) for gene expression data that consists of a wide collection of tools you require, right from Data Extraction to Model Deployment. candis is built on top of the toolkit - CancerDiscover written by the bioinformaticians at HelikarLab.

Citation: If you use candis please cite our work
Mohammed, A., Biegert, G., Adamec, J., & Helikar, T. (2017). Identification of potential tissue-specific cancer biomarkers and development of cancer versus normal genomic classifiers. Oncotarget, 8(49), 85692-85715. https://doi.org/10.18632/oncotarget.21127

Or

Mohammed, A., Biegert, G., Adamec, J., & Helikar, T. (2018). CancerDiscover: An integrative pipeline for cancer biomarker and cancer class prediction from high-throughput sequencing data. Oncotarget, 9(2), 2565-2573. https://doi.org/10.18632/oncotarget.23511

WARNING: candis currently is still in dev mode and not production-ready yet. In case if you run across bugs or errors, raise an issue over here.

Table of Contents

Installation

Assuming you've installed dependencies, simply

$ pip install candis

TL;DR

$ curl -sL git.io/install-candis | python # with dependencies

... and launch candis's development server:

$ candis

To install candis right from scratch, check out our exhaustive guides:

Docker Image

You can also attempt to install candis via Docker as follows:

$ docker pull helikarlab/candis

... and simply run the image optionally mapping the port 5000.

$ docker run -p 8888:5000 helikarlab/candis

OR

After cloning the repository, build from the updated Dockerfile and docker-compose.yml:

For development:

$ ./manage up -d --build

For production:

$ CANDIS_ENVIRONMENT=production ./manage up -d --build

Then go to localhost:5000 in your browser to open the app.

Other Commands:

$ ./manage [service] [command]

$ ./manage db backup			 		# Backup the database
$ ./manage db restore /path/to/backup	# Restore a snapshot
$ ./manage db backups 				 	# List all backups

Usage

Launching the RIA (Rich Internet Application)

via CLI

$ candis

OR

$ python -m candis

via Python

>>> import candis
>>> candis.main()

Using the CLI (Command Line Interface)

$ candis --cdata path/to/data.cdata --config path/to/config.json

Using the Jupyter Notebook from inside the docker container

  • Starting the jupyter notebook server inside the candis app container
$ docker-compose exec app jupyter notebook --ip 0.0.0.0 --no-browser --allow-root

Features

  • Converting a CDATA to an ARFF file

     >>> import candis
     >>> cdata = candis.cdata.read('path/to/data.cdata')

    Then, simply use the CData.toARFF API:

     >>> cdata.toARFF('path/to/data.arff')
  • Running a Pipeline.

     >>> pipe = candis.Pipeline()
     >>> pipe.run(cdata)
     >>> while pipe.status == candis.Pipeline.RUNNING:
     ...     # do something while pipeline is running

Dependencies

  • Production Dependencies
    • R
    • WEKA (NOTE: Requires Java)
    • Python 3.6+ and PIP (Python's Package Manager)
    • NumPy
  • Development Dependencies

Team


Dr. Tomas Helikar
[email protected]

Principal Investigator


Dr. Akram Mohammed
[email protected]

Author and Maintainer


Achilles Rasquinha
[email protected]

Author and Maintainer


Rupav Jain
[email protected]

Author and Maintainer

License

This software has been released under the GNU General Public License v3.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].