All Git Users → ogrisel

15 open source projects by ogrisel

1. Pygbm
Experimental Gradient Boosting Machines in Python with numba.
✭ 165
python
2. Pignlproc
Apache Pig utilities to build training corpora for machine learning / NLP out of public Wikipedia and DBpedia dumps.
✭ 160
java
3. Python Appveyor Demo
Demo project for building Python wheels with appveyor.com
✭ 152
powershell
4. Parallel ml tutorial
Tutorial on scikit-learn and IPython for parallel machine learning
5. Docker Distributed
Experimental docker-compose setup to bootstrap distributed on a docker-swarm cluster.
✭ 93
shell
6. Spylearn
Repo for experiments on pyspark and sklearn
✭ 80
python
7. Paper2ebook
Utility to re-structure research papers published in US Letter or A4 format PDF files to typically remove the 2 columns layout.
✭ 53
java
8. Dbpediakit
Python utilities to do work with the DBpedia dumps for analytics.
✭ 38
python
9. Wheelhouse Uploader
Script to help maintain a wheelhouse folder on a cloud storage.
✭ 28
python
10. Notebooks
Some sample IPython notebooks for scikit-learn
11. oglearn
ogrisel's utility extensions for scikit-learn
✭ 24
python
12. corpusmaker
clojure utilities to build training corpora for machine learning / NLP out of public wikimedia dumps: status - partially stalled - will probably be reworked as cascalog scripts -- this project is in stalled mode right now: the pignlproc project is likely to replace it due to licensing constraints for future integration in Apache projects
13. mahout
Personal development repository to prepare contributions and patches for Apache Mahout
✭ 15
javaperl
14. text-mining-class
Introduction to web scraping and text mining
15. my-linux-devbox
Vagrant / Salt configuration with Ubuntu to work on projects related to the scipy stack under Python 3 and Python 2
✭ 26
scheme
1-15 of 15 user projects