tieba-zhuaqu百度贴吧分布式爬虫,用于贴吧数据挖掘。从贴吧维度和用户维度进行数据分析
Stars: ✭ 56 (+64.71%)
elucidateconvenience functions to help researchers elucidate patterns in their data
Stars: ✭ 26 (-23.53%)
python ml tutorialA complete tutorial in python for Data Analysis and Machine Learning
Stars: ✭ 118 (+247.06%)
computational-neuroscienceShort undergraduate course taught at University of Pennsylvania on computational and theoretical neuroscience. Provides an introduction to programming in MATLAB, single-neuron models, ion channel models, basic neural networks, and neural decoding.
Stars: ✭ 36 (+5.88%)
open-diggerOpen source analysis tools
Stars: ✭ 193 (+467.65%)
MooseMOOSE - Platform for software and data analysis.
Stars: ✭ 110 (+223.53%)
vinumVinum is a SQL processor for Python, designed for data analysis workflows and in-memory analytics.
Stars: ✭ 57 (+67.65%)
dflibIn-memory Java DataFrame library
Stars: ✭ 50 (+47.06%)
ipaddressData analysis of IP addresses and networks
Stars: ✭ 20 (-41.18%)
online-course-recommendation-systemBuilt on data from Pluralsight's course API fetched results. Works with model trained with K-means unsupervised clustering algorithm.
Stars: ✭ 31 (-8.82%)
pyglotaranA Python library for Global and Target Analysis of time-resolved spectroscopy data
Stars: ✭ 33 (-2.94%)
iMOKAinteractive Multi Objective K-mer Analysis
Stars: ✭ 19 (-44.12%)
facerec-bias-bfwSource code and notebooks to reproduce experiments and benchmarks on Bias Faces in the Wild (BFW).
Stars: ✭ 40 (+17.65%)
Infinite Stories with DataThis repo consists of my analysis of random datasets using various statistical and visualization techniques.
Stars: ✭ 21 (-38.24%)
meta-csvA Clojure smart reader for CSV files
Stars: ✭ 20 (-41.18%)
stats📈 Useful notes and personal collections on statistics.
Stars: ✭ 16 (-52.94%)
FDBeyeR tools for eyetracker workflows.
Stars: ✭ 101 (+197.06%)
dsrIntroduction to Data Science with R (2017)
Stars: ✭ 25 (-26.47%)
dask-awkwardNative Dask collection for awkward arrays, and the library to use it.
Stars: ✭ 25 (-26.47%)
LeTourDataSetEvery cyclist and stage of the Tour de France in two CSV files.
Stars: ✭ 61 (+79.41%)
golearn🔥 Golang basics and actual-combat (including: crawler, distributed-systems, data-analysis, redis, etcd, raft, crontab-task)
Stars: ✭ 36 (+5.88%)
Fraud-Detection-in-Online-TransactionsDetecting Frauds in Online Transactions using Anamoly Detection Techniques Such as Over Sampling and Under-Sampling as the ratio of Frauds is less than 0.00005 thus, simply applying Classification Algorithm may result in Overfitting
Stars: ✭ 41 (+20.59%)
ipython-notebooksA collection of Jupyter notebooks exploring different datasets.
Stars: ✭ 43 (+26.47%)
RepSePReproducible Self-Publishing - Demo Publications in the Most Common Formats
Stars: ✭ 14 (-58.82%)
8-Week-SQL-ChallengeCase study solutions for #8WeekSQLChallenge at https://8weeksqlchallenge.com
Stars: ✭ 43 (+26.47%)
Naive-Resume-MatchingText Similarity Applied to resume, to compare Resumes with Job Descriptions and create a score to rank them. Similar to an ATS.
Stars: ✭ 27 (-20.59%)
advanced-pandasPandas is a powerful tool for data exploration and analysis (including timeseries).
Stars: ✭ 22 (-35.29%)
datatileA library for managing, validating, summarizing, and visualizing data.
Stars: ✭ 419 (+1132.35%)
mixedvinesPython package for canonical vine copula trees with mixed continuous and discrete marginals
Stars: ✭ 36 (+5.88%)
PythonTipsDSPython Tips for Data Scientist
Stars: ✭ 23 (-32.35%)
ospiOpen Source Presence Infographic of Indian Startups
Stars: ✭ 25 (-26.47%)
covidvizProfessional visualizations of COVID-19, emulating NYT, The Guardian, Washington Post, The Economist & others, using only Python & Altair.
Stars: ✭ 24 (-29.41%)
Chapter-2Code examples for Chapter 2 of Data Wrangling with JavaScript
Stars: ✭ 16 (-52.94%)
dataViz CADiMaterials for the "Data Visualization" CADi workshop @ "Tecnológico de Monterrey"
Stars: ✭ 14 (-58.82%)
tutorialsShort programming tutorials pertaining to data analysis.
Stars: ✭ 14 (-58.82%)
DataProfilerWhat's in your data? Extract schema, statistics and entities from datasets
Stars: ✭ 843 (+2379.41%)
uetaiCustom ML tracking experiment and debugging tools.
Stars: ✭ 17 (-50%)
ttbbeerAn R Dataset Package for US Beer Statistics From TTB 🍺
Stars: ✭ 23 (-32.35%)
R4EconR Code Examples Multi-dimensional/Panel Data
Stars: ✭ 16 (-52.94%)
Product-Categorization-NLPMulti-Class Text Classification for products based on their description with Machine Learning algorithms and Neural Networks (MLP, CNN, Distilbert).
Stars: ✭ 30 (-11.76%)
ipychartThe power of Chart.js with Python
Stars: ✭ 48 (+41.18%)
taller SparkRTaller SparkR para las Jornadas de Usuarios de R
Stars: ✭ 12 (-64.71%)
antzANTz immersive 3D data visualization engine
Stars: ✭ 25 (-26.47%)
akshareAKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
Stars: ✭ 5,155 (+15061.76%)
genieGenie: A Fast and Robust Hierarchical Clustering Algorithm (this R package has now been superseded by genieclust)
Stars: ✭ 21 (-38.24%)
MSG-Book📖 现代统计图形(已由人民邮电出版社出版) Modern Statistical Graphics
Stars: ✭ 95 (+179.41%)
ggshakeRAn analysis and visualization R package that works with publicly available soccer data
Stars: ✭ 69 (+102.94%)
metrics📈 What to measure, how to measure it.
Stars: ✭ 14 (-58.82%)