1. Datasetssource{d} datasets ("big code") for source code analysis and machine learning on source code
4. Go BillyThe missing interface filesystem abstraction for Go
5. GitbaseSQL interface to git repositories, written in Go. https://docs.sourced.tech/gitbase
6. LapjvLinear Assignmment Problem solver using Jonker-Volgenant algorithm - Python 3 native module.
9. LookoutAssisted code review, running custom code analyzers on pull requests
10. Mlsourced.ml is a library and command line tools to build and apply machine learning models on top of Universal Abstract Syntax Trees
13. HerculesGaining advanced insights from Git repository history.
14. Borgesborges collects and stores Git repositories.
15. Gitbase Webgitbase web client; source{d} CE comes with a new UI, check it at https://docs.sourced.tech/community-edition/
16. VecinoVecino is a command line application to discover Git repositories which are similar to the one that the user provides.
18. Code2vecMLonCode community effort to implement Learning Distributed Representations of Code (https://arxiv.org/pdf/1803.09473.pdf)
19. Coreos NvidiaYet another NVIDIA driver container for Container Linux (aka CoreOS)
20. Go KallaxKallax is a PostgreSQL typesafe ORM for the Go language.
21. KmcudaLarge scale K-means and K-nn implementation on NVIDIA GPU / CUDA
24. Go GitProject has been moved to: https://github.com/go-git/go-git
25. EnryA faster file programming language detector
27. GuideAiming to be a fully transparent company. All information about source{d} and what it's like to work here.
29. okrsObjectives & Key Results repository for the source{d} team
31. apolloAdvanced similarity and duplicate source code proof of concept for our research efforts.
32. modelforgePython library to share machine learning models easily and reliably.
33. flamingoFlamingo is a very thin and simple platform-agnostic chat bot framework
34. tmscNo description, website, or topics provided.
35. jgit-spark-connectorjgit-spark-connector is a library for running scalable data retrieval pipelines that process any number of Git repositories for source code analysis.