tieba-zhuaqu百度贴吧分布式爬虫,用于贴吧数据挖掘。从贴吧维度和用户维度进行数据分析
Stars: ✭ 56 (+12%)
LeTourDataSetEvery cyclist and stage of the Tour de France in two CSV files.
Stars: ✭ 61 (+22%)
Product-Categorization-NLPMulti-Class Text Classification for products based on their description with Machine Learning algorithms and Neural Networks (MLP, CNN, Distilbert).
Stars: ✭ 30 (-40%)
open-diggerOpen source analysis tools
Stars: ✭ 193 (+286%)
advanced-pandasPandas is a powerful tool for data exploration and analysis (including timeseries).
Stars: ✭ 22 (-56%)
genieGenie: A Fast and Robust Hierarchical Clustering Algorithm (this R package has now been superseded by genieclust)
Stars: ✭ 21 (-58%)
gocellsEvent Based Applications [DEPRECATED]
Stars: ✭ 69 (+38%)
RepSePReproducible Self-Publishing - Demo Publications in the Most Common Formats
Stars: ✭ 14 (-72%)
vinumVinum is a SQL processor for Python, designed for data analysis workflows and in-memory analytics.
Stars: ✭ 57 (+14%)
ospiOpen Source Presence Infographic of Indian Startups
Stars: ✭ 25 (-50%)
hotmapWebGL Heatmap Viewer for Big Data and Bioinformatics
Stars: ✭ 13 (-74%)
facerec-bias-bfwSource code and notebooks to reproduce experiments and benchmarks on Bias Faces in the Wild (BFW).
Stars: ✭ 40 (-20%)
yt-channels-DS-AI-ML-CSA comprehensive list of 180+ YouTube Channels for Data Science, Data Engineering, Machine Learning, Deep learning, Computer Science, programming, software engineering, etc.
Stars: ✭ 1,038 (+1976%)
meta-csvA Clojure smart reader for CSV files
Stars: ✭ 20 (-60%)
akshareAKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
Stars: ✭ 5,155 (+10210%)
Fraud-Detection-in-Online-TransactionsDetecting Frauds in Online Transactions using Anamoly Detection Techniques Such as Over Sampling and Under-Sampling as the ratio of Frauds is less than 0.00005 thus, simply applying Classification Algorithm may result in Overfitting
Stars: ✭ 41 (-18%)
pyglotaranA Python library for Global and Target Analysis of time-resolved spectroscopy data
Stars: ✭ 33 (-34%)
PSelectPowerShell DSL for aggregating data
Stars: ✭ 27 (-46%)
mixedvinesPython package for canonical vine copula trees with mixed continuous and discrete marginals
Stars: ✭ 36 (-28%)
stats📈 Useful notes and personal collections on statistics.
Stars: ✭ 16 (-68%)
golearn🔥 Golang basics and actual-combat (including: crawler, distributed-systems, data-analysis, redis, etcd, raft, crontab-task)
Stars: ✭ 36 (-28%)
covid-19COVID-19 World is yet another Project to build a Dashboard like app to showcase the data related to the COVID-19(Corona Virus).
Stars: ✭ 28 (-44%)
8-Week-SQL-ChallengeCase study solutions for #8WeekSQLChallenge at https://8weeksqlchallenge.com
Stars: ✭ 43 (-14%)
datatileA library for managing, validating, summarizing, and visualizing data.
Stars: ✭ 419 (+738%)
covidvizProfessional visualizations of COVID-19, emulating NYT, The Guardian, Washington Post, The Economist & others, using only Python & Altair.
Stars: ✭ 24 (-52%)
genieclustGenie++ Fast and Robust Hierarchical Clustering with Noise Point Detection - for Python and R
Stars: ✭ 34 (-32%)
dataViz CADiMaterials for the "Data Visualization" CADi workshop @ "Tecnológico de Monterrey"
Stars: ✭ 14 (-72%)
dflibIn-memory Java DataFrame library
Stars: ✭ 50 (+0%)
taucmdrPerformance engineering for the rest of us.
Stars: ✭ 26 (-48%)
FDBeyeR tools for eyetracker workflows.
Stars: ✭ 101 (+102%)
Naive-Resume-MatchingText Similarity Applied to resume, to compare Resumes with Job Descriptions and create a score to rank them. Similar to an ATS.
Stars: ✭ 27 (-46%)
metrics📈 What to measure, how to measure it.
Stars: ✭ 14 (-72%)
EEGEduInteractive Brain Playground - Browser based tutorials on EEG with webbluetooth and muse
Stars: ✭ 91 (+82%)
ipython-notebooksA collection of Jupyter notebooks exploring different datasets.
Stars: ✭ 43 (-14%)
computational-neuroscienceShort undergraduate course taught at University of Pennsylvania on computational and theoretical neuroscience. Provides an introduction to programming in MATLAB, single-neuron models, ion channel models, basic neural networks, and neural decoding.
Stars: ✭ 36 (-28%)
online-course-recommendation-systemBuilt on data from Pluralsight's course API fetched results. Works with model trained with K-means unsupervised clustering algorithm.
Stars: ✭ 31 (-38%)
PythonTipsDSPython Tips for Data Scientist
Stars: ✭ 23 (-54%)
elucidateconvenience functions to help researchers elucidate patterns in their data
Stars: ✭ 26 (-48%)
dnotebookDnotebook is a Jupyter-like library for javaScript environment. It allows you to create and share pages that contain live code, text and visualizations.
Stars: ✭ 109 (+118%)
iMOKAinteractive Multi Objective K-mer Analysis
Stars: ✭ 19 (-62%)
uetaiCustom ML tracking experiment and debugging tools.
Stars: ✭ 17 (-66%)
MooseMOOSE - Platform for software and data analysis.
Stars: ✭ 110 (+120%)
XrayDBX-ray Reference Data in SQLite library, including Python interface
Stars: ✭ 26 (-48%)
taller SparkRTaller SparkR para las Jornadas de Usuarios de R
Stars: ✭ 12 (-76%)
Dominando-PandasEste repositório está destinado ao processo de aprendizagem da biblioteca Pandas.
Stars: ✭ 22 (-56%)
AlphaPlot📈 Application for statistical analysis and data visualization which can generate different types of publication quality 2D and 3D plots with extensive visual customization.
Stars: ✭ 140 (+180%)
Data-VisualizationCollection of interactive Jupiter Notebook widgets and graphs.
Stars: ✭ 112 (+124%)
re-datare_data - fix data issues before your users & CEO would discover them 😊
Stars: ✭ 955 (+1810%)
ggshakeRAn analysis and visualization R package that works with publicly available soccer data
Stars: ✭ 69 (+38%)