bigquery-data-lineageReference implementation for real-time Data Lineage tracking for BigQuery using Audit Logs, ZetaSQL and Dataflow.
Stars: ✭ 112 (+433.33%)
dataflow-contact-center-speech-analysisSpeech Analysis Framework, a collection of components and code from Google Cloud that you can use to transcribe audio files to create analytics.
Stars: ✭ 46 (+119.05%)
TweebankNLP[LREC 2022] An off-the-shelf pre-trained Tweet NLP Toolkit (NER, tokenization, lemmatization, POS tagging, dependency parsing) + Tweebank-NER dataset
Stars: ✭ 84 (+300%)
wink-tokenizerMultilingual tokenizer that automatically tags each token with its type
Stars: ✭ 51 (+142.86%)
tkseemArabic Tokenization Library. It provides many tokenization algorithms.
Stars: ✭ 45 (+114.29%)
bigflowA Python framework for data processing on GCP.
Stars: ✭ 96 (+357.14%)
lunasecLunaSec - Dependency Security Scanner that automatically notifies you about vulnerabilities like Log4Shell or node-ipc in your Pull Requests and Builds. Protect yourself in 30 seconds with the LunaTrace GitHub App: https://github.com/marketplace/lunatrace-by-lunasec/
Stars: ✭ 1,261 (+5904.76%)
MeemooappCreative apps to use, build, share, and hack in the browser.
Stars: ✭ 220 (+947.62%)
gotchaGo Taint CHeck Analyser
Stars: ✭ 40 (+90.48%)
Azure Services MapA visual representation and reference to Azure services
Stars: ✭ 189 (+800%)
DataflowTemplatesConvenient Dataflow pipelines for transforming data between cloud data sources
Stars: ✭ 22 (+4.76%)
nightfall dlp actionGitHub Data Loss Prevention (DLP) Action: Scan Pull Requests for sensitive data, like credentials & secrets, PII, credit card numbers, and more.
Stars: ✭ 46 (+119.05%)
xontrib-output-searchGet identifiers, paths, URLs and words from the previous command output and use them for the next command in xonsh shell.
Stars: ✭ 26 (+23.81%)
simplemmaSimple multilingual lemmatizer for Python, especially useful for speed and efficiency
Stars: ✭ 32 (+52.38%)
github-watchmanMonitoring GitHub for sensitive data shared publicly
Stars: ✭ 60 (+185.71%)
limaThe Libre Multilingual Analyzer, a Natural Language Processing (NLP) C++ toolkit.
Stars: ✭ 75 (+257.14%)
data-lineageGenerate and Visualize Data Lineage from query history
Stars: ✭ 166 (+690.48%)
ChigraphA visual systems language for beginners compiled using LLVM
Stars: ✭ 247 (+1076.19%)
joernOpen-source code analysis platform for C/C++/Java/Binary/Javascript/Python/Kotlin based on code property graphs
Stars: ✭ 968 (+4509.52%)
MicrofloLive dataflow programming for microcontrollers and embedded
Stars: ✭ 207 (+885.71%)
polycashThe ultimate open source betting protocol. PolyCash is a P2P blockchain platform for wallets, asset issuance, bonds & gaming.
Stars: ✭ 24 (+14.29%)
PytA Static Analysis Tool for Detecting Security Vulnerabilities in Python Web Applications
Stars: ✭ 2,061 (+9714.29%)
deduceDeduce: de-identification method for Dutch medical text
Stars: ✭ 40 (+90.48%)
Blocks.jsJavaScript dataflow graph editor
Stars: ✭ 165 (+685.71%)
datacatalog-tag-managerPython package to manage Google Cloud Data Catalog tags, loading metadata from external sources -- currently supports the CSV file format
Stars: ✭ 17 (-19.05%)
Data-StashData-Stash是基于FISCO-BCOS的数据仓库组件,通过解析节点的binlog日志,生成该节点状态的全量备份,从而使节点能够实现冷热数据分离和数据裁剪。
Stars: ✭ 27 (+28.57%)
nlp-cheat-sheet-pythonNLP Cheat Sheet, Python, spacy, LexNPL, NLTK, tokenization, stemming, sentence detection, named entity recognition
Stars: ✭ 69 (+228.57%)
FATFactom Asset Tokens - Open tokenization standards on Factom
Stars: ✭ 17 (-19.05%)
yarrYer another array library
Stars: ✭ 42 (+100%)
wb-toolboxSimulink toolbox to rapidly prototype robot controllers
Stars: ✭ 20 (-4.76%)
PothosCommsCommunications blocks and support libraries
Stars: ✭ 15 (-28.57%)
ObservableComputationsCross-platform .NET library for computations whose arguments and results are objects that implement INotifyPropertyChanged and INotifyCollectionChanged (ObservableCollection) interfaces.
Stars: ✭ 94 (+347.62%)
sqllineageSQL Lineage Analysis Tool powered by Python
Stars: ✭ 348 (+1557.14%)
EmbbEmbedded Multicore Building Blocks (EMB²): Library for parallel programming of embedded systems. Star us on GitHub? +1
Stars: ✭ 153 (+628.57%)
re-viewTools for building reactive user interfaces in ClojureScript.
Stars: ✭ 40 (+90.48%)
flowgraphFlowgraph package for scalable asynchronous system development
Stars: ✭ 51 (+142.86%)
Vaaku2VecLanguage Modeling and Text Classification in Malayalam Language using ULMFiT
Stars: ✭ 68 (+223.81%)
actACT hardware description language and core tools.
Stars: ✭ 53 (+152.38%)
Dnai.EditorDnai Editor - Visual Scripting (Node Editor)
Stars: ✭ 117 (+457.14%)
whoshiringA browser for Hacker News's Ask HN: Who's Hiring, with Matrix Inside(tm)
Stars: ✭ 24 (+14.29%)
PothoscoreThe Pothos data-flow framework
Stars: ✭ 232 (+1004.76%)
Pythonflow🐍 Dataflow programming for python.
Stars: ✭ 215 (+923.81%)
spacy-server🦜 Containerized HTTP API for industrial-strength NLP via spaCy and sense2vec
Stars: ✭ 58 (+176.19%)
Vue BlocksVue2 dataflow graph editor
Stars: ✭ 201 (+857.14%)
ScioA Scala API for Apache Beam and Google Cloud Dataflow.
Stars: ✭ 2,247 (+10600%)
Data-ExportData-Export支持将链上数据导出到MySQL、ES等便于进行大数据处理的存储介质中,解决区块链数据复杂查询、分析、可视化和处理的问题。
Stars: ✭ 37 (+76.19%)
NipyapiA convenient Python wrapper for Apache NiFi
Stars: ✭ 169 (+704.76%)
lingNatural Language Processing Toolkit in Golang
Stars: ✭ 57 (+171.43%)
pyroclasticFunctional dataflow through composable computations
Stars: ✭ 17 (-19.05%)
dspatchThe Refreshingly Simple Cross-Platform C++ Dataflow / Pipelining / Stream Processing / Reactive Programming Framework
Stars: ✭ 124 (+490.48%)
PothosDemosPothos demonstration applications
Stars: ✭ 24 (+14.29%)
dtaskDTask is a scheduler for statically dependent tasks.
Stars: ✭ 17 (-19.05%)
Data-ReconcileData-Reconcile是一款基于区块链的对账组件,提供基于区块链智能合约账本的通用化数据对账解决方案,并提供了一套可动态扩展的对账框架,支持定制化开发。
Stars: ✭ 24 (+14.29%)