All Projects → fccoelho → curso-IRI

fccoelho / curso-IRI

Licence: other
Introdução à Recuperação de Informações

Programming Languages

Jupyter Notebook
11667 projects

Labels

Projects that are alternatives of or similar to curso-IRI

nlp-cheat-sheet-python
NLP Cheat Sheet, Python, spacy, LexNPL, NLTK, tokenization, stemming, sentence detection, named entity recognition
Stars: ✭ 69 (+392.86%)
Mutual labels:  nltk
JavaScript-compiler
编程语言的本质:语言只是一串字符,我们认为它是什么,它就可以是什么
Stars: ✭ 51 (+264.29%)
Mutual labels:  ir
ru punkt
Russian language support for NLTK's PunktSentenceTokenizer
Stars: ✭ 49 (+250%)
Mutual labels:  nltk
allsummarizer
Multilingual automatic text summarizer using statistical approach and extraction
Stars: ✭ 28 (+100%)
Mutual labels:  ir
ATtiny13-TinyRemoteXL
12-Button IR Remote Control
Stars: ✭ 33 (+135.71%)
Mutual labels:  ir
nltk-maxent-pos-tagger
maximum entropy based part-of-speech tagger for NLTK
Stars: ✭ 45 (+221.43%)
Mutual labels:  nltk
Azure-Sentinel-4-SecOps
Microsoft Sentinel SOC Operations
Stars: ✭ 140 (+900%)
Mutual labels:  ir
summarize-webpage
A small NLP SAAS project that summarize a webpage
Stars: ✭ 34 (+142.86%)
Mutual labels:  nltk
Stock-Analyser
📈 Stocks technical analysis code collection and Stocks data platform.
Stars: ✭ 30 (+114.29%)
Mutual labels:  nltk
Introduction-to-text-mining-with-Python
Lectures in Urban Data Science Lab, Seoul
Stars: ✭ 25 (+78.57%)
Mutual labels:  nltk
youtube-video-maker
📹 A tool for automatic video creation and uploading on YouTube
Stars: ✭ 134 (+857.14%)
Mutual labels:  nltk
ir datasets
Provides a common interface to many IR ranking datasets.
Stars: ✭ 190 (+1257.14%)
Mutual labels:  ir
Reuters-21578-Classification
Text classification with Reuters-21578 datasets using Gensim Word2Vec and Keras LSTM
Stars: ✭ 44 (+214.29%)
Mutual labels:  nltk
character-extraction
Extracts character names from a text file and performs analysis of text sentences containing the names.
Stars: ✭ 40 (+185.71%)
Mutual labels:  nltk
Product-Categorization-NLP
Multi-Class Text Classification for products based on their description with Machine Learning algorithms and Neural Networks (MLP, CNN, Distilbert).
Stars: ✭ 30 (+114.29%)
Mutual labels:  nltk
ipython-notebook-nltk
An introduction to Natural Language processing using NLTK with python.
Stars: ✭ 19 (+35.71%)
Mutual labels:  nltk
pypcode
Python bindings to Ghidra's SLEIGH library for disassembly and lifting to p-code IR
Stars: ✭ 111 (+692.86%)
Mutual labels:  ir
tweets-preprocessor
Repo containing the Twitter preprocessor module, developed by the AUTH OSWinds team
Stars: ✭ 26 (+85.71%)
Mutual labels:  nltk
CompilersCourse
Theory of compilation course, MIPT
Stars: ✭ 32 (+128.57%)
Mutual labels:  ir
reddit-opinion-mining
Sentiment analysis and opinion mining of Reddit data.
Stars: ✭ 15 (+7.14%)
Mutual labels:  nltk

Sistema de Recuperação de Informações

Introdução à Sistemas Recuperação de Informações.

Tópicos

Tópico 1: Introdução e Recuperação Booleana

Tópico 2: Utilizando Indices Invertidos

Tópico 3: Recuperação Tolerante

Tópico 4: Modelo Vetorial

Tópico 5: Recuperação Probabilística

Tópico 6: Modelagem de Assuntos:

Tópico 7: Modelagem probabilistica de Textos:

Tópico 8: Modelos de Linguagem:


Corpora interessantes para uso no curso

  1. Wikileaks Telegramas "vazados" pelo wikileaks. Ver também notebook.
  2. Dicionário Histórico e Biográfico Brasileiro - DHBB. Ver notebook
  3. Wikipedia. veja este notebook
  4. Blogs brasileiros.
  5. Corpora do Tensorflow. Veja lista.

Lista dos trabalhos 2020

Documento no hackmd


Livro de referência:


Softwares

IR:

NLP

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].