PdpipeEasy pipelines for pandas DataFrames.
Stars: ✭ 590 (+697.3%)
Dataframe GoDataFrames for Go: For statistics, machine-learning, and data manipulation/exploration
Stars: ✭ 487 (+558.11%)
PanderaA light-weight, flexible, and expressive pandas data validation library
Stars: ✭ 506 (+583.78%)
Spacy Models💫 Models for the spaCy Natural Language Processing (NLP) library
Stars: ✭ 796 (+975.68%)
SequoiaA股自动选股程序,实现了海龟交易法则、缠中说禅牛市买点,以及其他若干种技术形态
Stars: ✭ 564 (+662.16%)
Spacy Transformers🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy
Stars: ✭ 919 (+1141.89%)
Subreddit AnalyzerA comprehensive Data and Text Mining workflow for submissions and comments from any given public subreddit.
Stars: ✭ 447 (+504.05%)
Dataframe JsA javascript library providing a new data structure for datascientists and developpers
Stars: ✭ 376 (+408.11%)
BiopandasWorking with molecular structures in pandas DataFrames
Stars: ✭ 329 (+344.59%)
DatasheetsRead data from, write data to, and modify the formatting of Google Sheets
Stars: ✭ 593 (+701.35%)
S3bpRead and write Python objects to S3, caching them on your hard drive to avoid unnecessary IO.
Stars: ✭ 24 (-67.57%)
Spark DariaEssential Spark extensions and helper methods ✨😲
Stars: ✭ 553 (+647.3%)
BevelOrdinal regression in Python
Stars: ✭ 41 (-44.59%)
Mexican Government ReportText Mining on the 2019 Mexican Government Report, covering from extracting text from a PDF file to plotting the results.
Stars: ✭ 473 (+539.19%)
FoxcrossAsyncIO serving for data science models
Stars: ✭ 18 (-75.68%)
Projects🪐 End-to-end NLP workflows from prototype to production
Stars: ✭ 397 (+436.49%)
W.i.l.lA python written personal assistant
Stars: ✭ 377 (+409.46%)
VaexOut-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualize and explore big tabular data at a billion rows per second 🚀
Stars: ✭ 6,793 (+9079.73%)
OptopsyA nimble options backtesting library for Python
Stars: ✭ 373 (+404.05%)
SparkmagicJupyter magics and kernels for working with remote Spark clusters
Stars: ✭ 954 (+1189.19%)
CamphrspaCy plugin for Transformers , Udify, ELmo, etc.
Stars: ✭ 327 (+341.89%)
PyjanitorClean APIs for data cleaning. Python implementation of R package Janitor
Stars: ✭ 647 (+774.32%)
EvidentlyInteractive reports to analyze machine learning models during validation or production monitoring.
Stars: ✭ 304 (+310.81%)
MobiusC# and F# language binding and extensions to Apache Spark
Stars: ✭ 929 (+1155.41%)
MordecaiFull text geoparsing as a Python library
Stars: ✭ 579 (+682.43%)
KlayersPython Packages as AWS Lambda Layers
Stars: ✭ 557 (+652.7%)
BoltzmanncleanFill missing values in Pandas DataFrames using Restricted Boltzmann Machines
Stars: ✭ 23 (-68.92%)
Spacy Stanza💥 Use the latest Stanza (StanfordNLP) research models directly in spaCy
Stars: ✭ 508 (+586.49%)
Dragonfirethe open-source virtual assistant for Ubuntu based Linux distributions
Stars: ✭ 1,120 (+1413.51%)
PandasvaultAdvanced Pandas Vault — Utilities, Functions and Snippets (by @firmai).
Stars: ✭ 316 (+327.03%)
QuickvizVisualize a pandas dataframe in a few clicks
Stars: ✭ 18 (-75.68%)
Spacy💫 Industrial-strength Natural Language Processing (NLP) in Python
Stars: ✭ 21,978 (+29600%)
Lambda PacksPrecompiled packages for AWS Lambda
Stars: ✭ 997 (+1247.3%)
Pytablewriterpytablewriter is a Python library to write a table in various formats: CSV / Elasticsearch / HTML / JavaScript / JSON / LaTeX / LDJSON / LTSV / Markdown / MediaWiki / NumPy / Excel / Pandas / Python / reStructuredText / SQLite / TOML / TSV.
Stars: ✭ 422 (+470.27%)
DataframeC++ DataFrame for statistical, Financial, and ML analysis -- in modern C++ using native types, continuous memory storage, and no pointers are involved
Stars: ✭ 828 (+1018.92%)
ArqueroQuery processing and transformation of array-backed data tables.
Stars: ✭ 384 (+418.92%)
Sense2vec🦆 Contextually-keyed word vectors
Stars: ✭ 1,184 (+1500%)
PandastableTable analysis in Tkinter using pandas DataFrames.
Stars: ✭ 376 (+408.11%)
Spark RedisA connector for Spark that allows reading and writing to/from Redis cluster
Stars: ✭ 773 (+944.59%)
PrettypandasA Pandas Styler class for making beautiful tables
Stars: ✭ 376 (+408.11%)
Pandas TaTechnical Analysis Indicators - Pandas TA is an easy to use Python 3 Pandas Extension with 130+ Indicators
Stars: ✭ 962 (+1200%)
ModinModin: Speed up your Pandas workflows by changing a single line of code
Stars: ✭ 6,639 (+8871.62%)
Spacy Streamlit👑 spaCy building blocks and visualizers for Streamlit apps
Stars: ✭ 360 (+386.49%)
PyinflectA python module for word inflections designed for use with spaCy.
Stars: ✭ 52 (-29.73%)
Adam qasADAM - A Question Answering System. Inspired from IBM Watson
Stars: ✭ 330 (+345.95%)
CltkThe Classical Language Toolkit
Stars: ✭ 650 (+778.38%)
PystoreFast data store for Pandas time-series data
Stars: ✭ 325 (+339.19%)
Pandas ProfilingCreate HTML profiling reports from pandas DataFrame objects
Stars: ✭ 8,329 (+11155.41%)
DatafusionDataFusion has now been donated to the Apache Arrow project
Stars: ✭ 611 (+725.68%)
Python nlp tutorialThis repository provides everything to get started with Python for Text Mining / Natural Language Processing (NLP)
Stars: ✭ 72 (-2.7%)
Text Analytics With PythonLearn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository contains code and datasets used in my book, "Text Analytics with Python" published by Apress/Springer.
Stars: ✭ 1,132 (+1429.73%)
Spacy Lookups Data📂 Additional lookup tables and data resources for spaCy
Stars: ✭ 48 (-35.14%)
ScispacyA full spaCy pipeline and models for scientific/biomedical documents.
Stars: ✭ 855 (+1055.41%)
SmileStatistical Machine Intelligence & Learning Engine
Stars: ✭ 5,412 (+7213.51%)