All Git Users → amplab

22 open source projects by amplab

1. Spark Indexedrdd
An efficient updatable key-value store for Apache Spark
✭ 252
scala
2. Snap
Scalable Nucleotide Alignment Program -- a fast and accurate read aligner for high-throughput sequencing data
✭ 198
3. Datascience Sp14
Repository for data science course Spring 14
✭ 184
shell
4. Mli
An API for Distributed Machine Learning
✭ 155
scala
5. Training
Training materials for Strata, AMP Camp, etc
✭ 154
scala
6. Carat
Carat: Collaborative Energy Debugging
✭ 116
java
7. Drizzle Spark
Drizzle integration with Apache Spark
✭ 115
scala
9. Benchmark
Large scale query engine benchmark
✭ 97
python
10. Ampcrowd
A RESTful web service that runs microtasks across multiple crowds, provides quality control techniques, and is easily extensible.
✭ 51
python
11. Shark
Development in Shark has been ended.
✭ 996
scala
12. Cyclades
Cyclades
✭ 27
13. Sparknet
Distributed Neural Networks for Spark
✭ 604
scala
14. Keystone
Simplifying robust end-to-end machine learning on Apache Spark.
✭ 472
scala
15. Spark Ec2
Scripts used to setup a Spark cluster on EC2
✭ 372
python
16. Graphx
Former GraphX development repository. GraphX has been merged into Apache Spark; please submit pull requests there.
✭ 333
scala
17. Docker Scripts
Dockerfiles and scripts for Spark and Shark Docker images
✭ 260
shell
18. Succinct
Enabling queries on compressed data.
19. ampcamp
scripts used for ampcamp
20. smash
Benchmarking toolkit for variant calling
21. training-scripts
Scripts to launch cluster used for Strata
22. ernest
Code for Ernest
✭ 28
python
1-22 of 22 user projects