All Projects → mattdeitke → Cvpr2019

mattdeitke / Cvpr2019

Licence: mit
Displays all the 2019 CVPR Accepted Papers in a way that they are easy to parse.

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Cvpr2019

Gifcurry
😎 The open-source, Haskell-built video editor for GIF makers.
Stars: ✭ 830 (+1176.92%)
Mutual labels:  imagemagick
Maman
Rust Web Crawler saving pages on Redis
Stars: ✭ 39 (-40%)
Mutual labels:  web-crawler
Jekyll Assets
🎨 Asset pipelines for Jekyll.
Stars: ✭ 1,083 (+1566.15%)
Mutual labels:  imagemagick
Siteshot
Website screenshot capturer written in Go.
Stars: ✭ 18 (-72.31%)
Mutual labels:  imagemagick
Doctopics
Various examples of topic modeling and other text analysis
Stars: ✭ 32 (-50.77%)
Mutual labels:  lda
Crawlab
Distributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台,支持任何语言和框架
Stars: ✭ 8,392 (+12810.77%)
Mutual labels:  web-crawler
Imgbot
An Azure Function solution to crawl through all of your image files in GitHub and losslessly compress them. This will make the file size go down, but leave the dimensions and quality untouched. Once it's done, ImgBot will open a pull request for you to review and merge. [email protected]
Stars: ✭ 732 (+1026.15%)
Mutual labels:  imagemagick
Multilingual Latent Dirichlet Allocation Lda
A Multilingual Latent Dirichlet Allocation (LDA) Pipeline with Stop Words Removal, n-gram features, and Inverse Stemming, in Python.
Stars: ✭ 64 (-1.54%)
Mutual labels:  lda
Patternrecognition matlab
Feature reduction projections and classifier models are learned by training dataset and applied to classify testing dataset. A few approaches of feature reduction have been compared in this paper: principle component analysis (PCA), linear discriminant analysis (LDA) and their kernel methods (KPCA,KLDA). Correspondingly, a few approaches of classification algorithm are implemented: Support Vector Machine (SVM), Gaussian Quadratic Maximum Likelihood and K-nearest neighbors (KNN) and Gaussian Mixture Model(GMM).
Stars: ✭ 33 (-49.23%)
Mutual labels:  lda
Wand
The ctypes-based simple ImageMagick binding for Python
Stars: ✭ 1,074 (+1552.31%)
Mutual labels:  imagemagick
Sv Images
Image manipulation library with an HTTP based API.
Stars: ✭ 7 (-89.23%)
Mutual labels:  imagemagick
Dutsso
快速登录大连理工大学统一身份认证系统(SSO)的Python模块,可轻松实现成绩提醒、抢课、玉兰卡信息、个人信息查询等功能。
Stars: ✭ 32 (-50.77%)
Mutual labels:  web-crawler
Social Media Depression Detector
😔 😞 😣 😖 😩 Detect depression on social media using the ssToT method introduced in our ASONAM 2017 paper titled "Semi-Supervised Approach to Monitoring Clinical Depressive Symptoms in Social Media"
Stars: ✭ 45 (-30.77%)
Mutual labels:  lda
Mockup
Perspective app screens and isometric mock-up tool. Automated high quality, high resolution, marketing art designer for mockups. Written in minimalist Bash using ImageMagick and Love! [Application, Art]
Stars: ✭ 18 (-72.31%)
Mutual labels:  imagemagick
Imager
Image processing proxy
Stars: ✭ 56 (-13.85%)
Mutual labels:  imagemagick
Flyimg
Dockerized PHP7 application runs as a Microservice to resize and crop images on the fly. Get optimised images with MozJPEG, WebP or PNG using ImageMagick. Includes face detection, cropping, face blurring, image rotation and many other options. Abstract storage based on FlySystem in order to store images on any provider (local, AWS S3...).
Stars: ✭ 762 (+1072.31%)
Mutual labels:  imagemagick
Mod dims
Apache HTTP dynamic image resizing module
Stars: ✭ 40 (-38.46%)
Mutual labels:  imagemagick
Abotx
Cross Platform C# Web crawler framework, headless browser, parallel crawler. Please star this project! +1.
Stars: ✭ 63 (-3.08%)
Mutual labels:  web-crawler
Terpene Profile Parser For Cannabis Strains
Parser and database to index the terpene profile of different strains of Cannabis from online databases
Stars: ✭ 63 (-3.08%)
Mutual labels:  web-crawler
Lightlda
fast sampling algorithm based on CGS
Stars: ✭ 49 (-24.62%)
Mutual labels:  lda

Updated repository at https://github.com/mattdeitke/CVPR-Accepted-Papers-Viewer


CVPR 2019 Accepted Papers

The main goal of these scripts is to build a page that displays the accepted papers for CVPR 2019 in a way that is easier for humans to parse (see: https://mattdeitke.com/CVPR-2019). Below is an example of what this repository will display, and following that is what CVPR open access currently shows.

In particular, there is functionality to cluster papers based on latent Dirichlet allocation topics, create thumbnail images from the first 8 pages of each PDF, find the abstracts, copy a BibTeX, view the paper and supplementary material, and more. The scripts use Python 3.7 and should work for any past and future CVPR conference (unless they change how they display the pages). Modifications can be made to adapt the format to another conference.

Installation

  1. Clone this repository git clone https://github.com/mattdeitke/CVPR2019

  2. Save the HTML from where the accepted papers are displayed. For CVPR, this year, that would be http://openaccess.thecvf.com/CVPR2019.py.

  3. Install ImageMagick, which can be done using sudo apt-get install imagemagick or using another supported method such as brew install imagemagick.

  4. Run pdftowordcloud.py to generate top words for each paper. The output is saved in topwords.p.

  5. Run pdftothumbs.py to generate tiny thumbnails for all papers. The outputs are saved in thumbs/ folder.

  6. Run scrape.py to generate each paperid, title, authors list by scraping the cvpr2019oar.html page.

  7. Run makecorpus.py to create allpapers.txt file that has all papers (one per row).

  8. Run python lda.py -f allpapers.txt -k 7 --alpha=0.5 --beta=0.5 -i 100 . This will generate a pickle file called ldaphi.p that contains the LDA word distribution matrix. Thanks to this nice LDA code by @shuyo! It requires nltk library and numpy. In this example we are using 7 categories. You would need to change the cvprnice_template.html file a bit if you wanted to try different number of categories.

  9. Generate the abstract files inside abstracts/ folder using getabstracts.py.

  10. Finally, run generatenicelda.py to create the index.html page.

Acknowledgements

Big thanks to @karpathy for his NeurIPS preview and ArXiV Sanity Preserver, which is what this repository builds on! Also a thanks to @tholman for creating a more modern GitHub Corners and @shuyo for the LDA code! Finally, more thanks go to the people at CVPR for openly publishing all of their accepted papers!

Licence

MIT License

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].