All Projects → Hnswlib → Similar Projects or Alternatives

939 Open source projects that are alternatives of or similar to Hnswlib

O'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian

Stars: ✭ 34 (-68.52%)

Mutual labels: spark, pyspark

Linkis helps easily connect to various back-end computation/storage engines(Spark, Python, TiDB...), exposes various interfaces(REST, JDBC, Java ...), with multi-tenancy, high performance, and resource control.

Stars: ✭ 2,323 (+2050.93%)

Mutual labels: spark, pyspark

Learningapachespark

LearningApacheSpark

Stars: ✭ 155 (+43.52%)

Mutual labels: spark, pyspark

Spark Iforest

Isolation Forest on Spark

Stars: ✭ 166 (+53.7%)

Mutual labels: spark, pyspark

basin

Basin is a visual programming editor for building Spark and PySpark pipelines. Easily build, debug, and deploy complex ETL pipelines from your browser

Stars: ✭ 25 (-76.85%)

Mutual labels: spark, pyspark

incubator-linkis

Stars: ✭ 2,459 (+2176.85%)

Mutual labels: spark, pyspark

Relation extraction

Relation Extraction using Deep learning(CNN)

Stars: ✭ 96 (-11.11%)

Mutual labels: spark, pyspark

Pyspark Learning

Updated repository

Stars: ✭ 147 (+36.11%)

Mutual labels: spark, pyspark

ODSC India 2018

My presentation at ODSC India 2018 about Deep Learning with Apache Spark

Stars: ✭ 26 (-75.93%)

Mutual labels: spark, pyspark

spark-extension

A library that provides useful extensions to Apache Spark and PySpark.

Stars: ✭ 25 (-76.85%)

Mutual labels: spark, pyspark

Spark With Python

Fundamentals of Spark with Python (using PySpark), code examples

Stars: ✭ 150 (+38.89%)

Mutual labels: spark, pyspark

Optimus

🚚 Agile Data Preparation Workflows made easy with dask, cudf, dask_cudf and pyspark

Stars: ✭ 986 (+812.96%)

Mutual labels: spark, pyspark

Cc Pyspark

Process Common Crawl data with Python and Spark

Stars: ✭ 147 (+36.11%)

Mutual labels: spark, pyspark

Spark Nlp

State of the Art Natural Language Processing

Stars: ✭ 2,518 (+2231.48%)

Mutual labels: spark, pyspark

Mmlspark

Simple and Distributed Machine Learning

Stars: ✭ 2,899 (+2584.26%)

Mutual labels: spark, pyspark

Pyspark Example Project

Example project implementing best practices for PySpark ETL jobs and applications.

Stars: ✭ 633 (+486.11%)

Mutual labels: spark, pyspark

Pyspark Cheatsheet

🐍 Quick reference guide to common patterns & functions in PySpark.

Stars: ✭ 108 (+0%)

Mutual labels: spark, pyspark

Spark Py Notebooks

Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks

Stars: ✭ 1,338 (+1138.89%)

Mutual labels: spark, pyspark

Sparkmagic

Jupyter magics and kernels for working with remote Spark clusters

Stars: ✭ 954 (+783.33%)

Mutual labels: spark, pyspark

Repository

个人学习知识库涉及到数据仓库建模、实时计算、大数据、Java、算法等。

Stars: ✭ 92 (-14.81%)

Mutual labels: algorithm, spark

Gimel

Big Data Processing Framework - Unified Data API or SQL on Any Storage

Stars: ✭ 216 (+100%)

Mutual labels: spark, pyspark

aut

The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.

Stars: ✭ 111 (+2.78%)

Mutual labels: spark, pyspark

data processing course

Some class materials for a data processing course using PySpark

Stars: ✭ 50 (-53.7%)

Mutual labels: spark, pyspark

Sparkling Titanic

Training models with Apache Spark, PySpark for Titanic Kaggle competition

Stars: ✭ 12 (-88.89%)

Mutual labels: spark, pyspark

Live log analyzer spark

Spark Application for analysis of Apache Access logs and detect anamolies! Along with Medium Article.

Stars: ✭ 14 (-87.04%)

Mutual labels: spark, pyspark

Spark Tdd Example

A simple Spark TDD example

Stars: ✭ 23 (-78.7%)

Mutual labels: spark, pyspark

W2v

Word2Vec models with Twitter data using Spark. Blog:

Stars: ✭ 64 (-40.74%)

Mutual labels: spark, pyspark

Eat pyspark in 10 days

pyspark🍒🥭 is delicious，just eat it!😋😋

Stars: ✭ 116 (+7.41%)

Mutual labels: spark, pyspark

Handyspark

HandySpark - bringing pandas-like capabilities to Spark dataframes

Stars: ✭ 158 (+46.3%)

Mutual labels: spark, pyspark

Scriptis

Scriptis is for interactive data analysis with script development(SQL, Pyspark, HiveQL), task submission(Spark, Hive), UDF, function, resource management and intelligent diagnosis.

Stars: ✭ 696 (+544.44%)

Mutual labels: spark, pyspark

Java learning practice

java 进阶之路：面试高频算法、akka、多线程、NIO、Netty、SpringBoot、Spark&&Flink 等

Stars: ✭ 110 (+1.85%)

Mutual labels: algorithm, spark

Spark Practice

Apache Spark (PySpark) Practice on Real Data

Stars: ✭ 200 (+85.19%)

Mutual labels: spark, pyspark

kafka-compose

🎼 Docker compose files for various kafka stacks

Stars: ✭ 32 (-70.37%)

Mutual labels: spark, pyspark

Azure Cosmosdb Spark

Apache Spark Connector for Azure Cosmos DB

Stars: ✭ 165 (+52.78%)

Mutual labels: spark, pyspark

Devops Python Tools

80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Function, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Travis CI, AWS CloudFormation, Elasticsearch, Solr etc.

Stars: ✭ 406 (+275.93%)

Mutual labels: spark, pyspark

Pysparkgeoanalysis

🌐 Interactive Workshop on GeoAnalysis using PySpark

Stars: ✭ 63 (-41.67%)

Mutual labels: spark, pyspark

Spark python ml examples

Spark 2.0 Python Machine Learning examples

Stars: ✭ 87 (-19.44%)

Mutual labels: spark, pyspark

Earcut

The fastest and smallest JavaScript polygon triangulation library for your WebGL apps

Stars: ✭ 1,359 (+1158.33%)

Mutual labels: algorithm

Frontend knowledge

📚 Important Frontend Knowledge（前端知识汇总）

Stars: ✭ 103 (-4.63%)

Mutual labels: algorithm

Advisor

Open-source implementation of Google Vizier for hyper parameters tuning

Stars: ✭ 1,359 (+1158.33%)

Mutual labels: algorithm

Fracture

generative algorithm

Stars: ✭ 99 (-8.33%)

Mutual labels: algorithm

Go Algorithms

Algorithms and data structures for golang

Stars: ✭ 1,529 (+1315.74%)

Mutual labels: algorithm

Quadsort

Quadsort is a stable adaptive merge sort which is faster than quicksort.

Stars: ✭ 1,385 (+1182.41%)

Mutual labels: algorithm

Algorithms

Algorithms and data structures implemented in JavaScript with explanations, for further readings

Stars: ✭ 99 (-8.33%)

Mutual labels: algorithm

Almond

A Scala kernel for Jupyter

Stars: ✭ 1,354 (+1153.7%)

Mutual labels: spark

Fast methods

N-Dimensional Fast Methods: Fast Marching, Fast Sweeping, Group Marching, Fast Iterative, etc.

Stars: ✭ 102 (-5.56%)

Mutual labels: algorithm

Pyspark Stubs

Apache (Py)Spark type annotations (stub files).

Stars: ✭ 98 (-9.26%)

Mutual labels: pyspark

Deep Reinforcement Learning With Pytorch

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

Stars: ✭ 1,345 (+1145.37%)

Mutual labels: algorithm

Any Angle Pathfinding

A collection of algorithms used for any-angle pathfinding with visualisations.

Stars: ✭ 107 (-0.93%)

Mutual labels: algorithm

Codelibrary

💎Collection of algorithms and data structures

Stars: ✭ 1,585 (+1367.59%)

Mutual labels: algorithm

Acm Icpc Preparation

ACM-ICPC Preparation Guide

Stars: ✭ 1,377 (+1175%)

Mutual labels: algorithm

Mystl

C++11 实现的简易版 STL

Stars: ✭ 97 (-10.19%)

Mutual labels: algorithm

Logisland

Scalable stream processing platform for advanced realtime analytics on top of Kafka and Spark. LogIsland also supports MQTT and Kafka Streams (Flink being in the roadmap). The platform does complex event processing and is suitable for time series analysis. A large set of valuable ready to use processors, data sources and sinks are available.

Stars: ✭ 97 (-10.19%)

Mutual labels: spark

Spark Terasort

Stars: ✭ 101 (-6.48%)

Mutual labels: spark

Schemer

Schema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API.

Stars: ✭ 97 (-10.19%)

Mutual labels: spark

Delaunator

An incredibly fast JavaScript library for Delaunay triangulation of 2D points

Stars: ✭ 1,641 (+1419.44%)