All Projects → victorqribeiro → Hntitlenator

victorqribeiro / Hntitlenator

Licence: mit
Test your HN title against a neural network

Programming Languages

javascript
184084 projects - #8 most used programming language

Projects that are alternatives of or similar to Hntitlenator

Coursera Natural Language Processing Specialization
Programming assignments from all courses in the Coursera Natural Language Processing Specialization offered by deeplearning.ai.
Stars: ✭ 39 (-78.8%)
Mutual labels:  natural-language-processing, nlp-machine-learning
Codesearchnet
Datasets, tools, and benchmarks for representation learning of code.
Stars: ✭ 1,378 (+648.91%)
Mutual labels:  natural-language-processing, nlp-machine-learning
How To Mine Newsfeed Data And Extract Interactive Insights In Python
A practical guide to topic mining and interactive visualizations
Stars: ✭ 61 (-66.85%)
Mutual labels:  natural-language-processing, nlp-machine-learning
Nlp profiler
A simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text data, NLP Profiler will return either high-level insights or low-level/granular statistical information about the text in that column.
Stars: ✭ 181 (-1.63%)
Mutual labels:  natural-language-processing, nlp-machine-learning
Nlp Pretrained Model
A collection of Natural language processing pre-trained models.
Stars: ✭ 122 (-33.7%)
Mutual labels:  natural-language-processing, nlp-machine-learning
Nlp Conference Compendium
Compendium of the resources available from top NLP conferences.
Stars: ✭ 349 (+89.67%)
Mutual labels:  natural-language-processing, nlp-machine-learning
Lda Topic Modeling
A PureScript, browser-based implementation of LDA topic modeling.
Stars: ✭ 91 (-50.54%)
Mutual labels:  natural-language-processing, nlp-machine-learning
Character Based Cnn
Implementation of character based convolutional neural network
Stars: ✭ 205 (+11.41%)
Mutual labels:  natural-language-processing, nlp-machine-learning
Lingo
package lingo provides the data structures and algorithms required for natural language processing
Stars: ✭ 113 (-38.59%)
Mutual labels:  natural-language-processing, nlp-machine-learning
Textaugmentation Gpt2
Fine-tuned pre-trained GPT2 for custom topic specific text generation. Such system can be used for Text Augmentation.
Stars: ✭ 104 (-43.48%)
Mutual labels:  natural-language-processing, nlp-machine-learning
Lingua
👄 The most accurate natural language detection library for Java and the JVM, suitable for long and short text alike
Stars: ✭ 341 (+85.33%)
Mutual labels:  natural-language-processing, nlp-machine-learning
Natural Language Processing Specialization
This repo contains my coursework, assignments, and Slides for Natural Language Processing Specialization by deeplearning.ai on Coursera
Stars: ✭ 151 (-17.93%)
Mutual labels:  natural-language-processing, nlp-machine-learning
Ner
Named Entity Recognition
Stars: ✭ 288 (+56.52%)
Mutual labels:  natural-language-processing, nlp-machine-learning
Text mining resources
Resources for learning about Text Mining and Natural Language Processing
Stars: ✭ 358 (+94.57%)
Mutual labels:  natural-language-processing, nlp-machine-learning
Machine Learning Resources
A curated list of awesome machine learning frameworks, libraries, courses, books and many more.
Stars: ✭ 226 (+22.83%)
Mutual labels:  natural-language-processing, nlp-machine-learning
Intent classifier
Stars: ✭ 67 (-63.59%)
Mutual labels:  natural-language-processing, nlp-machine-learning
Repo 2016
R, Python and Mathematica Codes in Machine Learning, Deep Learning, Artificial Intelligence, NLP and Geolocation
Stars: ✭ 103 (-44.02%)
Mutual labels:  natural-language-processing, nlp-machine-learning
Hands On Natural Language Processing With Python
This repository is for my students of Udemy. You can find all lecture codes along with mentioned files for reading in here. So, feel free to clone it and if you have any problem just raise a question.
Stars: ✭ 146 (-20.65%)
Mutual labels:  natural-language-processing, nlp-machine-learning
Pytorch Question Answering
Important paper implementations for Question Answering using PyTorch
Stars: ✭ 154 (-16.3%)
Mutual labels:  natural-language-processing, nlp-machine-learning
Web Database Analytics
Web scrapping and related analytics using Python tools
Stars: ✭ 175 (-4.89%)
Mutual labels:  natural-language-processing

HN Titlenator

A neural network to predict whether your HN post will get up votes by the title.

screenshot

Test your title here

About

A project about neural networks and NLP. Can a neural network predict how many up votes your HN post will have?

Motivation

Ever since I joined the hackernews community I've been wondering how one could get more attention when sharing a story. One of the things I noticed is the timing. The day of the week and the hour of the day you post your story seems to affect how many up votes your story will get.

In order to check that, I got 1256 stories from HN API. I then took the mean of the score of those stories and found a value of 70 up votes. Then I plotted a graph showing the day of the week and hour of the day those stories, who got more than 70 up votes, were posted.

plot

As you can see, Friday noon (UTC-3 Brasilia) seems to be the best day to post your story, since 18 stories posted that time had more than 70 upvotes (keep in mind that I only had access to 1256 stories, it's a very small sample compared to all the post HN must have every day).

Is that all? Are the time of the day and day of the week the ones responsible to get upvotes? Well, I decided to train a neural network with the words used on the titles and classify the title as good if it has more than 70 up votes and bad if it doesn't.

Neural Network

In order to train the neural network, I counted how many words were used in each title. The longest title had 17 words and the mean of all titles were 9 words. So, I model my neural network to receive 20 words as input. Titles with less then 20 words on them were padded with zeros.

I then turn each word into a value with the help of a dictionary.

I then created a simple web app where people could input their title and see how the neural network classifies it.

Limitations

This project is far from credible. All the things I did were to satisfy my own curiosity. With that being said, the bigger limitation I can see is that I only had access to a few stories. I also cannot validated the neural network prediction, cause in order for me to do that, I would have to write a content, come up with a title and then post it choosing words that triggers a good value on the neural network and post that history on a Friday noon, to see if my story succeed.

Source code

I tried to keep the code as clean as possible, but keep in mind that I did the whole experiment in one night. I tried to separate each step in it's own python script: get_data.py, proccess_data.py, train.py, retrain.py, plot.py


donate

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].