Texar PytorchIntegrating the Best of TF into PyTorch, for Machine Learning, Natural Language Processing, and Text Generation. This is part of the CASL project: http://casl-project.ai/
Stars: ✭ 636 (+127.96%)
PanderaA light-weight, flexible, and expressive pandas data validation library
Stars: ✭ 506 (+81.36%)
Awesome Web ScrapingList of libraries, tools and APIs for web scraping and data processing.
Stars: ✭ 4,510 (+1516.49%)
XidelCommand line tool to download and extract data from HTML/XML pages or JSON-APIs, using CSS, XPath 3.0, XQuery 3.0, JSONiq or pattern matching. It can also create new or transformed XML/HTML/JSON documents.
Stars: ✭ 335 (+20.07%)
Eternal👾~ music, eternal ~ 👾
Stars: ✭ 323 (+15.77%)
DaliA GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
Stars: ✭ 3,624 (+1198.92%)
NonechucksDeal with bad samples in your dataset dynamically, use Transforms as Filters, and more!
Stars: ✭ 304 (+8.96%)
RapidtablesSuper fast list of dicts to pre-formatted tables conversion library for Python 2/3
Stars: ✭ 292 (+4.66%)
HubDataset format for AI. Build, manage, & visualize datasets for deep learning. Stream data real-time to PyTorch/TensorFlow & version-control it. https://activeloop.ai
Stars: ✭ 4,003 (+1334.77%)
prostoProsto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby
Stars: ✭ 54 (-80.65%)
baleen3Baleen 3 is a data processing tool based on the Annot8 framework
Stars: ✭ 15 (-94.62%)
pulserlApache Pulsar client library for Erlang/Elixir
Stars: ✭ 15 (-94.62%)
alfa♿ Suite of open and standards-based tools for performing reliable accessibility conformance testing at scale
Stars: ✭ 75 (-73.12%)
meta-schemaLittle DSL to make data processing sane with clojure.spec and spec-tools
Stars: ✭ 25 (-91.04%)
pyGAPSA framework for processing adsorption data and isotherm fitting
Stars: ✭ 36 (-87.1%)
bonobo-sqlalchemyPREVIEW - SQL databases in Bonobo, using sqlalchemy
Stars: ✭ 23 (-91.76%)
cqClojure Command-line Data Processor for JSON, YAML, EDN, XML and more
Stars: ✭ 111 (-60.22%)
Speech-RecognitionEnd-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
Stars: ✭ 21 (-92.47%)
mech🦾 Main repository for the Mech programming language. Start here!
Stars: ✭ 135 (-51.61%)
tracemlEngine for ML/Data tracking, visualization, dashboards, and model UI for Polyaxon.
Stars: ✭ 445 (+59.5%)
stargateAn Apache Pulsar client written in Elixir
Stars: ✭ 33 (-88.17%)
ECG analysisNo description or website provided.
Stars: ✭ 32 (-88.53%)
parallel-corpora-toolsTools for filtering and cleaning parallel and monolingual corpora for machine translation and other natural language processing tasks.
Stars: ✭ 35 (-87.46%)
rec-coreData pipelining service
Stars: ✭ 19 (-93.19%)
ProcessorOntology-driven Linked Data processor and server for SPARQL backends. Apache License.
Stars: ✭ 54 (-80.65%)
machine-learning-data-pipelinePipeline module for parallel real-time data processing for machine learning models development and production purposes.
Stars: ✭ 22 (-92.11%)
rsgislibRemote Sensing and GIS Software Library; python module tools for processing spatial data.
Stars: ✭ 103 (-63.08%)
processorA simple and lightweight JavaScript data processing tool. Live demo:
Stars: ✭ 27 (-90.32%)
perkeA keyphrase extractor for Persian
Stars: ✭ 60 (-78.49%)