SparkleHaskell on Apache Spark.
Stars: ✭ 419 (+188.97%)
big-data-liteSamples to the Oracle Big Data Lite VM
Stars: ✭ 41 (-71.72%)
DataflowjavasdkGoogle Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.
Stars: ✭ 854 (+488.97%)
v6.dooring.public可视化大屏解决方案, 提供一套可视化编辑引擎, 助力个人或企业轻松定制自己的可视化大屏应用.
Stars: ✭ 323 (+122.76%)
onutonut. A little framework to make games in C++ or JavaScript
Stars: ✭ 40 (-72.41%)
Untech.sharepointUntech.SharePoint - library that will improve your work with Lists in SharePoint (can be used with SSOM and CSOM)
Stars: ✭ 8 (-94.48%)
xunit-ordererImplementation of ITestCaseOrderer enforcing xUnit to run the facts in strict order
Stars: ✭ 15 (-89.66%)
MobiusC# and F# language binding and extensions to Apache Spark
Stars: ✭ 929 (+540.69%)
graphiqueGraphQL service for arrow tables and parquet data sets.
Stars: ✭ 28 (-80.69%)
Bigdata File ViewerA cross-platform (Windows, MAC, Linux) desktop application to view common bigdata binary format like Parquet, ORC, AVRO, etc. Support local file system, HDFS, AWS S3, Azure Blob Storage ,etc.
Stars: ✭ 86 (-40.69%)
Bandar LogMonitoring tool to measure flow throughput of data sources and processing components that are part of Data Ingestion and ETL pipelines.
Stars: ✭ 19 (-86.9%)
NsoupNSoup is a .NET port of the jsoup (http://jsoup.org) HTML parser and sanitizer originally written in Java
Stars: ✭ 145 (+0%)
Datascience Ai Machinelearning ResourcesAlex Castrounis' curated set of resources for artificial intelligence (AI), machine learning, data science, internet of things (IoT), and more.
Stars: ✭ 414 (+185.52%)
OpenHSPHot Soup Processor (HSP3)
Stars: ✭ 120 (-17.24%)
Hadoop For GeoeventArcGIS GeoEvent Server sample Hadoop connector for storing GeoEvents in HDFS.
Stars: ✭ 5 (-96.55%)
Spark StatesCustom state store providers for Apache Spark
Stars: ✭ 83 (-42.76%)
couchdb-mangoMirror of Apache CouchDB Mango
Stars: ✭ 34 (-76.55%)
SqoopMirror of Apache Sqoop
Stars: ✭ 817 (+463.45%)
CmakCMAK is a tool for managing Apache Kafka clusters
Stars: ✭ 10,544 (+7171.72%)
Agile data code 2Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
Stars: ✭ 413 (+184.83%)
Goodreads etl pipelineAn end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
Stars: ✭ 793 (+446.9%)
parquet-usqlA custom extractor designed to read parquet for Azure Data Lake Analytics
Stars: ✭ 13 (-91.03%)
PanoptesA Global Scale Network Telemetry Ecosystem
Stars: ✭ 80 (-44.83%)
sparkApache Spark enhanced with native Kubernetes scheduler back-end: NOTE this repository is being ARCHIVED as all new development for the kubernetes scheduler back-end is now on https://github.com/apache/spark/
Stars: ✭ 609 (+320%)
AspiaRemote desktop and file transfer tool.
Stars: ✭ 784 (+440.69%)
classifai🔥 One of the most comprehensive open-source data annotation platform.
Stars: ✭ 99 (-31.72%)
Rakam Api📈 Collect customer event data from your apps. (Note that this project only includes the API collector, not the visualization platform)
Stars: ✭ 772 (+432.41%)
spark-utilsBasic framework utilities to quickly start writing production ready Apache Spark applications
Stars: ✭ 25 (-82.76%)
IotdbApache IoTDB
Stars: ✭ 1,221 (+742.07%)
Opendata.cern.chSource code for the CERN Open Data portal
Stars: ✭ 411 (+183.45%)
Spark Movie LensAn on-line movie recommender using Spark, Python Flask, and the MovieLens dataset
Stars: ✭ 745 (+413.79%)
AsakusafwAsakusa Framework
Stars: ✭ 114 (-21.38%)
CythonThe most widely used Python to C compiler
Stars: ✭ 6,588 (+4443.45%)
WarpConvert and analyze large data sets at light speed, on Mac and iOS.
Stars: ✭ 62 (-57.24%)
Devops Python Tools80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Function, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Travis CI, AWS CloudFormation, Elasticsearch, Solr etc.
Stars: ✭ 406 (+180%)
MLBDMaterials for "Machine Learning on Big Data" course
Stars: ✭ 20 (-86.21%)
SakuraSAKURA Editor (Japanese text editor for MS Windows)
Stars: ✭ 689 (+375.17%)
Big-Data-Demo基于Vue、three.js、echarts,数据可视化展示项目,包含三维模型导入交互、三维模型标注等功能
Stars: ✭ 146 (+0.69%)
Belajarpython.comOpen Source Indonesian Python Programming Tutorial Site
Stars: ✭ 141 (-2.76%)
Parquet.jlJulia implementation of Parquet columnar file format reader
Stars: ✭ 93 (-35.86%)
Fo DicomFellow Oak DICOM for .NET, .NET Core, Universal Windows, Android, iOS, Mono and Unity
Stars: ✭ 674 (+364.83%)
meetups-archivosPpts, códigos y videos de las meetups, data science days, videollamadas y workshops. Data Science Research es una organización sin fines de lucro que busca difundir, descentralizar y difundir los conocimientos en Ciencia de Datos e Inteligencia Artificial en el Perú, dando oportunidades a nuevos talentos mediante MeetUps, Workshops y Semilleros …
Stars: ✭ 60 (-58.62%)
Cogcomp NlpCogComp's Natural Language Processing libraries and Demos:
Stars: ✭ 410 (+182.76%)
NabhashAn extremely fast Non-crypto-safe AES Based Hash algorithm for Big Data
Stars: ✭ 62 (-57.24%)
MockneatMockNeat is a Java 8+ library that facilitates the generation of arbitrary data for your applications.
Stars: ✭ 410 (+182.76%)
Decentralized InternetA SDK/library for decentralized web and distributing computing projects
Stars: ✭ 406 (+180%)
PrigPrig is a lightweight framework for test indirections in .NET Framework.
Stars: ✭ 106 (-26.9%)
PetastormPetastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.
Stars: ✭ 1,108 (+664.14%)