CbrainCBRAIN is a flexible Ruby on Rails framework for accessing and processing of large data on high-performance computing infrastructures.
Stars: ✭ 51 (+240%)
MdsplusThe MDSplus data management system
Stars: ✭ 47 (+213.33%)
TdmR package for normalizing RNA-seq data to make them comparable to microarray data.
Stars: ✭ 33 (+120%)
Data Science On GcpSource code accompanying book: Data Science on the Google Cloud Platform, Valliappa Lakshmanan, O'Reilly 2017
Stars: ✭ 864 (+5660%)
DataflowjavasdkGoogle Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.
Stars: ✭ 854 (+5593.33%)
Texar PytorchIntegrating the Best of TF into PyTorch, for Machine Learning, Natural Language Processing, and Text Generation. This is part of the CASL project: http://casl-project.ai/
Stars: ✭ 636 (+4140%)
PanderaA light-weight, flexible, and expressive pandas data validation library
Stars: ✭ 506 (+3273.33%)
Awesome Web ScrapingList of libraries, tools and APIs for web scraping and data processing.
Stars: ✭ 4,510 (+29966.67%)
XidelCommand line tool to download and extract data from HTML/XML pages or JSON-APIs, using CSS, XPath 3.0, XQuery 3.0, JSONiq or pattern matching. It can also create new or transformed XML/HTML/JSON documents.
Stars: ✭ 335 (+2133.33%)
Eternal👾~ music, eternal ~ 👾
Stars: ✭ 323 (+2053.33%)
DaliA GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
Stars: ✭ 3,624 (+24060%)
NonechucksDeal with bad samples in your dataset dynamically, use Transforms as Filters, and more!
Stars: ✭ 304 (+1926.67%)
RapidtablesSuper fast list of dicts to pre-formatted tables conversion library for Python 2/3
Stars: ✭ 292 (+1846.67%)
HubDataset format for AI. Build, manage, & visualize datasets for deep learning. Stream data real-time to PyTorch/TensorFlow & version-control it. https://activeloop.ai
Stars: ✭ 4,003 (+26586.67%)
prostoProsto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby
Stars: ✭ 54 (+260%)
baleen3Baleen 3 is a data processing tool based on the Annot8 framework
Stars: ✭ 15 (+0%)