openrefine-batchShell script to run OpenRefine in batch mode (import, transform, export). It orchestrates OpenRefine (server) and a python client that communicates with the OpenRefine API.
Stars: ✭ 76 (+13.43%)
openrefine-dockerOpenRefine is a free, open source power tool for working with messy data and improving it. This repository contains Dockerbuild files for automated builds.
Stars: ✭ 19 (-71.64%)
Mara Example Project 2An example mini data warehouse for python project stats, template for new projects
Stars: ✭ 154 (+129.85%)
ElyraElyra extends JupyterLab Notebooks with an AI centric approach.
Stars: ✭ 839 (+1152.24%)
naas⚙️ Schedule notebooks, run them like APIs, expose securely your assets: Jupyter as a viable ⚡️ Production environment
Stars: ✭ 219 (+226.87%)
pyHeadspacecommand-line script to download headspace packs, singles, everyday meditation and other sessions. You could also download all packs at once
Stars: ✭ 37 (-44.78%)
tchambaTchamba.random, is a real random data genarator (letters, jokes, names...)
Stars: ✭ 11 (-83.58%)
judgeA blazingly fast online judge/ autograder ⚖️ built with Python and the Django framework to test cases against your solution. Check out the sponsor links and help fund DomeCode.
Stars: ✭ 30 (-55.22%)
terminalplotNo description or website provided.
Stars: ✭ 40 (-40.3%)
es2postgresElasticSearch to PostgreSQL loader
Stars: ✭ 18 (-73.13%)
gallia-coreA schema-aware Scala library for data transformation
Stars: ✭ 44 (-34.33%)
Library-Search-Plugin-PublicThe Library Search Plugin plugin allows users (students, researchers, etc.) to search your library's catalogue, Google Scholar, WorldCat, or PubMed, without having to navigate to the respective websites first! It also comes with a neat context menu that allows users to select text, right-click, and search!
Stars: ✭ 17 (-74.63%)
conciliatorOpenRefine reconciliation services for VIAF, ORCID, and Open Library + framework for creating more.
Stars: ✭ 95 (+41.79%)
maxwell-sinkconsume maxwell generated message from kafka,export it to another mysql.
Stars: ✭ 16 (-76.12%)
malossTowards Measuring Supply Chain Attacks on Package Managers for Interpreted Languages
Stars: ✭ 46 (-31.34%)
pyfoobarPython project template/scaffold and best practices
Stars: ✭ 31 (-53.73%)
ISS InfoPython wrapper for tracking information about International Space Station via http://open-notify.org
Stars: ✭ 12 (-82.09%)
persistityA persistence framework for game developers
Stars: ✭ 34 (-49.25%)
proxpiPyPI caching mirror
Stars: ✭ 19 (-71.64%)
HABAppEasy home automation with MQTT and/or openHAB
Stars: ✭ 35 (-47.76%)
sparklanesA lightweight data processing framework for Apache Spark
Stars: ✭ 17 (-74.63%)
twitivity🐍 Twitter Accounts Activity API Client Library for Python
Stars: ✭ 49 (-26.87%)
ertis-authGeneric token generator and validator service like auth
Stars: ✭ 28 (-58.21%)
rdkit-pypi⚛️ RDKit Python Wheels on PyPi. 💻 pip install rdkit-pypi
Stars: ✭ 62 (-7.46%)
versatile-data-kitVersatile Data Kit (VDK) is an open source framework that enables anybody with basic SQL or Python knowledge to create their own data pipelines.
Stars: ✭ 144 (+114.93%)
lineageGenerate beautiful documentation for your data pipelines in markdown format
Stars: ✭ 16 (-76.12%)
kozaData transformation framework for LinkML data models
Stars: ✭ 21 (-68.66%)
Bifrost基于JSON PRC 协议的一种Android跨进程调用解决方案。
Stars: ✭ 24 (-64.18%)
Binder🦁"Hello World" <-> [🏷, 🏷, 🏷, 🏷]
Stars: ✭ 37 (-44.78%)
slamdunkStreamlining SLAM-seq analysis with ultra-high sensitivity
Stars: ✭ 24 (-64.18%)
acesoPython package to calculate 2SFCA and other measures of spatial accessibility
Stars: ✭ 20 (-70.15%)
querycontactsQuery network abuse contacts for a given ip address on abuse-contacts.abusix.zone
Stars: ✭ 13 (-80.6%)
ProxyGrabAsynchronous Library made using Python and aiohttp to get proxies from multiple services!
Stars: ✭ 17 (-74.63%)
dswarman open-source data management platform for knowledge workers (https://github.com/dswarm/dswarm-documentation/wiki)
Stars: ✭ 57 (-14.93%)
TEAMThe Taxonomy for ETL Automation Metadata (TEAM) is a metadata management tool for data warehouse automation. It is part of the ecosystem for data warehouse automation, alongside the Virtual Data Warehouse pattern manager and the generic schema for Data Warehouse Automation.
Stars: ✭ 27 (-59.7%)
python-for-excelThis is the companion repo of the O'Reilly book "Python for Excel".
Stars: ✭ 253 (+277.61%)
DataXServer为DataX(https://github.com/alibaba/DataX) 提供远程多语言调用(ThriftServer,HttpServer) 分布式运行(DataX on YARN) 功能
Stars: ✭ 130 (+94.03%)
oesophagusEnterprise Grade Single-Step Streaming Data Infrastructure Setup. (Under Development)
Stars: ✭ 12 (-82.09%)
rfc-bibtexA command line tool that creates bibtex entries for IETF RFCs and Internet Drafts.
Stars: ✭ 43 (-35.82%)
dflibIn-memory Java DataFrame library
Stars: ✭ 50 (-25.37%)
wheelodexAn index of wheels
Stars: ✭ 20 (-70.15%)
pypi-toolsCommand-line Python scripts to do things with PyPI
Stars: ✭ 18 (-73.13%)
VestaboardAn API Wrapper for Vestaboards written in Python
Stars: ✭ 23 (-65.67%)
mydataharbor🇨🇳 MyDataHarbor是一个致力于解决任意数据源到任意数据源的分布式、高扩展性、高性能、事务级的数据同步中间件。帮助用户可靠、快速、稳定的对海量数据进行准实时增量同步或者定时全量同步,主要定位是为实时交易系统服务,亦可用于大数据的数据同步(ETL领域)。
Stars: ✭ 28 (-58.21%)
PDAP-ScrapersCode relating to scraping public police data.
Stars: ✭ 72 (+7.46%)
flytekitExtensible Python SDK for developing Flyte tasks and workflows. Simple to get started and learn and highly extensible.
Stars: ✭ 82 (+22.39%)
scholiaWikidata-based scholarly profiles
Stars: ✭ 166 (+147.76%)
DataBridge.NETConfigurable data bridge for permanent ETL jobs
Stars: ✭ 16 (-76.12%)