All Projects → rohitthapliyal2000 → codechef-rank-comparator

rohitthapliyal2000 / codechef-rank-comparator

Licence: MIT license
Web application hosted on Heroku cloud platform based on web scraping in python using lxml library (XML Path Language).

Programming Languages

python
139335 projects - #7 most used programming language
HTML
75241 projects
CSS
56736 projects

Projects that are alternatives of or similar to codechef-rank-comparator

restaurant-finder-featureReviews
Build a Flask web application to help users retrieve key restaurant information and feature-based reviews (generated by applying market-basket model – Apriori algorithm and NLP on user reviews).
Stars: ✭ 21 (-8.7%)
Mutual labels:  web-scraping, flask-application
selectorlib
A library to read a YML file with Xpath or CSS Selectors and extract data from HTML pages using them
Stars: ✭ 53 (+130.43%)
Mutual labels:  web-scraping, xpath
reapr
🕸→ℹ️ Reap Information from Websites
Stars: ✭ 14 (-39.13%)
Mutual labels:  web-scraping, xpath
playground
A Streamlit application to play with machine learning models directly from the browser
Stars: ✭ 48 (+108.7%)
Mutual labels:  heroku-deployment
scraping-ebay
Scraping Ebay's products using Scrapy Web Crawling Framework
Stars: ✭ 79 (+243.48%)
Mutual labels:  web-scraping
top-github-scraper
Scape top GitHub repositories and users based on keywords
Stars: ✭ 40 (+73.91%)
Mutual labels:  web-scraping
GSoC-Data-Analyser
Simple search for organisations participating/participated in the GSoC
Stars: ✭ 29 (+26.09%)
Mutual labels:  web-scraping
India-WhatsAppFakeNews-Dataset
WhatsApps related deaths News Articles along with other articles across India during that period
Stars: ✭ 41 (+78.26%)
Mutual labels:  web-scraping
Z-Spider
一些爬虫开发的技巧和案例
Stars: ✭ 33 (+43.48%)
Mutual labels:  xpath
Springboard-Data-Science-Immersive
No description or website provided.
Stars: ✭ 52 (+126.09%)
Mutual labels:  web-scraping
terrain generator
A wizard that generates terrains for Gazebo using height maps.
Stars: ✭ 46 (+100%)
Mutual labels:  lxml
text-mining-corona-articles
Text Mining for Indonesian Online News Articles About Corona
Stars: ✭ 15 (-34.78%)
Mutual labels:  web-scraping
DAM
Temario y ejercicios de Desarrollo de Aplicaciones Multiplataforma (DAM)
Stars: ✭ 96 (+317.39%)
Mutual labels:  xpath
heroshi
Heroshi – open source web crawler.
Stars: ✭ 51 (+121.74%)
Mutual labels:  web-scraping
DouBanReptile
豆瓣租房小组多线程爬虫。爬取后自动按时间排序生成markdown文件。
Stars: ✭ 31 (+34.78%)
Mutual labels:  xpath
actor-scraper
House of Apify Scrapers. Generic scraping actors with a simple UI to handle complex web crawling and scraping use cases.
Stars: ✭ 83 (+260.87%)
Mutual labels:  web-scraping
MLH-Quizzet
This is a smart Quiz Generator that generates a dynamic quiz from any uploaded text/PDF document using NLP. This can be used for self-analysis, question paper generation, and evaluation, thus reducing human effort.
Stars: ✭ 23 (+0%)
Mutual labels:  flask-application
img ai app boilerplate
An image classification app boilerplate to serve your deep learning models asap!
Stars: ✭ 27 (+17.39%)
Mutual labels:  heroku-deployment
tableau-scraping
Tableau scraper python library. R and Python scripts to scrape data from Tableau viz
Stars: ✭ 91 (+295.65%)
Mutual labels:  web-scraping
investigation-amazon-brands
Materials to reproduce our findings in our stories, "Amazon Puts Its Own 'Brands' First Above Better-Rated Products" and "When Amazon Takes the Buy Box, it Doesn’t Give it up"
Stars: ✭ 56 (+143.48%)
Mutual labels:  web-scraping

codechef-rank-comparator

Web scraping in python using lxml package (XML Path Language)

Input :

Two codechef usernames which are to be compared

Operations :

Checks if the entered username is valid or not

Scraps the name of the user

Scraps the current codechef ratings of the user

Outputs the list of mutually participated contests of the users and the winner

Packages/tools used :

lxml library to use html element API

requests library to send HTTP request to the webpage

sys library to exit the system in case of errors

Flask web framework

HTML and Javascript to create web template

Running the source code locally :

Clone/Download the repo

Compile/Run the application file i.e. app.py by running the following command:

$ python app.py

Enter the following URL in address bar of your browser

localhost:5000

Requirements to run the source :

Python 3

pip(Python Package Index) :

$ sudo apt-get install python3-pip

requests package :

$ pip3 install requests

lxml package :

$ sudo apt-get install libxml2-dev libxslt1-dev python-dev

$ pip install lxml

Flask package :

$ pip install flask

Description :

The script works by sending request at URL : 'codechef.com/users/handle'. This part is handled by Python. XML Path Language is used for crawling. Separate files are specified for the information being scraped. A dictionary [contest -> rank] is created for both the users. The mutual contests becomes the part of the result. The information is stored in a string 'result'.

Flask web framework is used for creating a Web API to link the HTML file with the python script.

The UI is created in HTML. The application is then deployed on Heroku cloud platform.

The application can be run using the terminal through local host or directly through the Heroku platform

This Project is a joint contribution of :

Rohit Thapliyal and Neeraj Negi

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].