SparkoraPowerful rapid automatic EDA and feature engineering library with a very easy to use API 🌟
Stars: ✭ 51 (-89.24%)
agentJob tracker & performance platform
Stars: ✭ 26 (-94.51%)
GeneticsGenetics (Initialization, Selection, Crossover, Mutation)
Stars: ✭ 15 (-96.84%)
browser-poolA Node.js library to easily manage and rotate a pool of web browsers, using any of the popular browser automation libraries like Puppeteer, Playwright, or SecretAgent.
Stars: ✭ 71 (-85.02%)
oversmashOverwatch API library for player details and career stats
Stars: ✭ 42 (-91.14%)
www-react-postgresA complete template for 2022 focused on around React, Postgres and various web3 integrations. You can use the template to make a website, a web application, a hybrid decentralized web application, or even a DAO.
Stars: ✭ 36 (-92.41%)
PopEDPopulation Experimental Design (PopED) in R
Stars: ✭ 27 (-94.3%)
scavengerScrape and take screenshots of dynamic and static webpages
Stars: ✭ 14 (-97.05%)
R-Insee-DataInseeFr.github.io/R-Insee-Data/
Stars: ✭ 15 (-96.84%)
jupyter-cacheA defined interface for working with a cache of executed jupyter notebooks
Stars: ✭ 28 (-94.09%)
datalake-etl-pipelineSimplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
Stars: ✭ 39 (-91.77%)
cognipyIn-memory Graph Database and Knowledge Graph with Natural Language Interface, compatible with Pandas
Stars: ✭ 31 (-93.46%)
anovosAnovos - An Open Source Library for Scalable feature engineering Using Apache-Spark
Stars: ✭ 77 (-83.76%)
lightdashAn open source alternative to Looker built using dbt. Made for analysts ❤️
Stars: ✭ 1,082 (+128.27%)
2017-wmata-ridership-dataIntraday ridership data from Washington Metro Area Transit Authority for 2009 and 2017 inaugurations and the Women's March.
Stars: ✭ 15 (-96.84%)
legoA lightweight SQL (string) builder using ES6 template strings. Lego embraces SQL instead of adding yet another abstraction layer.
Stars: ✭ 54 (-88.61%)
sheetfulThe sheetiest REST API on the block.
Stars: ✭ 65 (-86.29%)
jupyterlab-herokuJupyterLab extension to deploy applications to Heroku
Stars: ✭ 20 (-95.78%)
TSP-GATraveling Salesman Problem Using Parallel Genetic Algorithms
Stars: ✭ 29 (-93.88%)
asyncio-hnPython (asyncio) wrapper for hackernews api
Stars: ✭ 27 (-94.3%)
MapeathorTranslator of spreadsheet mappings into R2RML, RML or YARRRML
Stars: ✭ 27 (-94.3%)
opendataFinland national open data portal (avoindata.fi) source code.
Stars: ✭ 27 (-94.3%)
dbt mlPackage for dbt that allows users to train, audit and use BigQuery ML models.
Stars: ✭ 41 (-91.35%)
scrapy-fieldstatsA Scrapy extension to log items coverage when the spider shuts down
Stars: ✭ 17 (-96.41%)
wetterdienstOpen weather data for humans
Stars: ✭ 190 (-59.92%)
OpenOmicsA bioinformatics API and web-app to integrate multi-omics datasets & interface with public databases.
Stars: ✭ 22 (-95.36%)
systemdspawnerSpawn JupyterHub single-user notebook servers with systemd
Stars: ✭ 79 (-83.33%)
RARBG-scraperWith Selenium headless browsing and CAPTCHA solving
Stars: ✭ 38 (-91.98%)
dreO projecto agora reside no GitLab
Stars: ✭ 20 (-95.78%)
4catThe 4CAT Capture and Analysis Toolkit provides modular data capture & analysis for a variety of social media platforms.
Stars: ✭ 144 (-69.62%)
ScrappingMastering the art of scrapping 🎓
Stars: ✭ 24 (-94.94%)
openkamerInsight into the Dutch parliament
Stars: ✭ 43 (-90.93%)
Parallel.GAMITPython wrapper to parallelize GAMIT executions
Stars: ✭ 22 (-95.36%)
rfisheriespackage for interacting with fisheries databases at openfisheries.org
Stars: ✭ 24 (-94.94%)
phrase-at-scaleDetect common phrases in large amounts of text using a data-driven approach. Size of discovered phrases can be arbitrary. Can be used in languages other than English
Stars: ✭ 115 (-75.74%)
cbsodataUnofficial Statistics Netherlands (CBS) opendata API client for Python
Stars: ✭ 32 (-93.25%)
ScrapeBotA Selenium-driven tool for automated website interaction and scraping.
Stars: ✭ 16 (-96.62%)
pyspark-algorithmsPySpark Algorithms Book: https://www.amazon.com/dp/B07X4B2218/ref=sr_1_2
Stars: ✭ 72 (-84.81%)
velos-parisSynthèse des compteurs de vélos à Paris
Stars: ✭ 14 (-97.05%)
ArchiteuthisMITM HTTP(S) proxy with integrated load-balancing, rate-limiting and error handling. Built for automated web scraping.
Stars: ✭ 35 (-92.62%)
copycatA PHP Scraping Class
Stars: ✭ 70 (-85.23%)
gunaydinYour good mornings ☀️
Stars: ✭ 16 (-96.62%)
bhamtechA community-currated collection of tech resources, projects, and other things related for Birmingham, AL
Stars: ✭ 23 (-95.15%)
assignPOPPopulation Assignment using Genetic, Non-genetic or Integrated Data in a Machine-learning Framework. Methods in Ecology and Evolution. 2018;9:439–446.
Stars: ✭ 16 (-96.62%)
IdraIdra - Open Data Federation Platform
Stars: ✭ 15 (-96.84%)
rubiumRubium is a lightweight alternative to Selenium/Capybara/Watir if you need to perform some operations (like web scraping) using Headless Chromium and Ruby
Stars: ✭ 65 (-86.29%)
ipython pytestPytest magic for IPython notebooks
Stars: ✭ 33 (-93.04%)
zcrawlAn open source web crawling platform
Stars: ✭ 21 (-95.57%)
Python-Course🐍 This is the most complete course in Python, completely practical and all the lessons are explained with examples, so that they can be easily understood. 🍫
Stars: ✭ 18 (-96.2%)