CodesearchnetDatasets, tools, and benchmarks for representation learning of code.
Stars: ✭ 1,378 (+800.65%)
ake-datasetsLarge, curated set of benchmark datasets for evaluating automatic keyphrase extraction algorithms.
Stars: ✭ 125 (-18.3%)
Machine Learning ResourcesA curated list of awesome machine learning frameworks, libraries, courses, books and many more.
Stars: ✭ 226 (+47.71%)
Dl TextText pre-processing library for deep learning (Keras, tensorflow).
Stars: ✭ 119 (-22.22%)
Wb srgbWhite balance camera-rendered sRGB images (CVPR 2019) [Matlab & Python]
Stars: ✭ 101 (-33.99%)
Question GenerationGiven a sentence automatically generate reading comprehension style factual questions from that sentence, such that the sentence contains answers to those questions.
Stars: ✭ 100 (-34.64%)
Wiki SplitOne million English sentences, each split into two sentences that together preserve the original meaning, extracted from Wikipedia edits.
Stars: ✭ 95 (-37.91%)
Remo Python🐰 Python lib for remo - the app for annotations and images management in Computer Vision
Stars: ✭ 138 (-9.8%)
AestheticsImage Aesthetics Toolkit - includes Fisher Vector implementation, AVA (Image Aesthetic Visual Analysis) dataset and fast multi-threaded downloader
Stars: ✭ 113 (-26.14%)
DatascienceIt consists of examples, assignments discussed in data science course taken at algorithmica.
Stars: ✭ 92 (-39.87%)
Textaugmentation Gpt2Fine-tuned pre-trained GPT2 for custom topic specific text generation. Such system can be used for Text Augmentation.
Stars: ✭ 104 (-32.03%)
PipedreamConnect APIs, remarkably fast. Free for developers.
Stars: ✭ 2,068 (+1251.63%)
Mrc book《机器阅读理解:算法与实践》代码
Stars: ✭ 102 (-33.33%)
Onnxt5Summarization, translation, sentiment-analysis, text-generation and more at blazing speed using a T5 version implemented in ONNX.
Stars: ✭ 143 (-6.54%)
Doppelganger[IMC 2020 (Best Paper Finalist)] Using GANs for Sharing Networked Time Series Data: Challenges, Initial Promise, and Open Questions
Stars: ✭ 97 (-36.6%)
Bird Recognition ReviewA list of useful resources in the bird sound (song and calls) recognition, such as datasets, papers, links to open source projects and competitions
Stars: ✭ 116 (-24.18%)
Zzz Retired opensttRETIRED - OpenSTT is now retired. If you would like more information on Mycroft AI's open source STT projects, please visit:
Stars: ✭ 146 (-4.58%)
CrossweighCrossWeigh: Training Named Entity Tagger from Imperfect Annotations
Stars: ✭ 91 (-40.52%)
DareblopyData Reading Blocks for Python
Stars: ✭ 82 (-46.41%)
Seq2seq tutorialCode For Medium Article "How To Create Data Products That Are Magical Using Sequence-to-Sequence Models"
Stars: ✭ 132 (-13.73%)
CholeraR Package for Analyzing John Snow's 1854 Cholera Map
Stars: ✭ 110 (-28.1%)
Gopup数据接口:百度、谷歌、头条、微博指数,宏观数据,利率数据,货币汇率,千里马、独角兽公司,新闻联播文字稿,影视票房数据,高校名单,疫情数据…
Stars: ✭ 1,229 (+703.27%)
Nlp Pretrained ModelA collection of Natural language processing pre-trained models.
Stars: ✭ 122 (-20.26%)
Repo 2016R, Python and Mathematica Codes in Machine Learning, Deep Learning, Artificial Intelligence, NLP and Geolocation
Stars: ✭ 103 (-32.68%)
Pix2codepix2code: Generating Code from a Graphical User Interface Screenshot
Stars: ✭ 11,349 (+7317.65%)
Multi object datasetsMulti-object image datasets with ground-truth segmentation masks and generative factors.
Stars: ✭ 121 (-20.92%)
Transitland DatastoreTransitland's centralized web service API for both querying and editing aggregated transit data from around the world
Stars: ✭ 101 (-33.99%)
PinsPin, Discover and Share Resources
Stars: ✭ 149 (-2.61%)
Exposure correctionReference code for the paper "Learning Multi-Scale Photo Exposure Correction", CVPR 2021.
Stars: ✭ 98 (-35.95%)
G Reader2018年机器阅读理解技术竞赛模型,国内外1000多支队伍中BLEU-4评分排名第6, ROUGE-L评分排名第14。(未ensemble,未嵌入训练好的词向量,无dropout)
Stars: ✭ 117 (-23.53%)
Monkeylearn⛔️ ARCHIVED ⛔️ 🐒 R package for text analysis with Monkeylearn 🐒
Stars: ✭ 95 (-37.91%)
LazyLazy, AI chatbot service.
Stars: ✭ 141 (-7.84%)
Persian Swear Wordsدیتاست کلمات نامناسب و بد فارسی برای فیلتر کردن متن ها
Stars: ✭ 95 (-37.91%)
IdenprofIdenProf dataset is a collection of images of identifiable professionals. It is been collected to enable the development of AI systems that can serve by identifying people and the nature of their job by simply looking at an image, just like humans can do.
Stars: ✭ 149 (-2.61%)
Doc2vec📓 Long(er) text representation and classification using Doc2Vec embeddings
Stars: ✭ 92 (-39.87%)
Lingopackage lingo provides the data structures and algorithms required for natural language processing
Stars: ✭ 113 (-26.14%)
Lda Topic ModelingA PureScript, browser-based implementation of LDA topic modeling.
Stars: ✭ 91 (-40.52%)
SummarusModels for automatic abstractive summarization
Stars: ✭ 83 (-45.75%)
FirstcoursenetworkscienceTutorials, datasets, and other material associated with textbook "A First Course in Network Science" by Menczer, Fortunato & Davis
Stars: ✭ 111 (-27.45%)
Openml RR package to interface with OpenML
Stars: ✭ 81 (-47.06%)
Hands On Natural Language Processing With PythonThis repository is for my students of Udemy. You can find all lecture codes along with mentioned files for reading in here. So, feel free to clone it and if you have any problem just raise a question.
Stars: ✭ 146 (-4.58%)
Atis datasetThe ATIS (Airline Travel Information System) Dataset
Stars: ✭ 81 (-47.06%)
AtnreAdversarial Training for Neural Relation Extraction
Stars: ✭ 108 (-29.41%)
Russian news corpusRussian mass media stemmed texts corpus / Корпус лемматизированных (морфологически нормализованных) текстов российских СМИ
Stars: ✭ 76 (-50.33%)
Nlp Paper自然语言处理领域下的对话语音领域,整理相关论文(附阅读笔记),复现模型以及数据处理等(代码含TensorFlow和PyTorch两版本)
Stars: ✭ 67 (-56.21%)
ChineseglueLanguage Understanding Evaluation benchmark for Chinese: datasets, baselines, pre-trained models,corpus and leaderboard
Stars: ✭ 1,548 (+911.76%)