All Projects → aserg-ufmg → CSIndex

aserg-ufmg / CSIndex

Licence: MIT license
Transparent data about Brazilian scientific production in Computer Science

Programming Languages

HTML
75241 projects
javascript
184084 projects - #8 most used programming language
python
139335 projects - #7 most used programming language
CSS
56736 projects

Projects that are alternatives of or similar to CSIndex

Werdlists
⌨️ Wordlists, Dictionaries and Other Data Sets for Writing Software Security Test Cases
Stars: ✭ 216 (+535.29%)
Mutual labels:  computer-science
Javascript Algorithms
📝 Algorithms and data structures implemented in JavaScript with explanations and links to further readings
Stars: ✭ 133,406 (+392270.59%)
Mutual labels:  computer-science
curriculum
A roadmap for Boot.dev's CS curriculum for backend developers
Stars: ✭ 492 (+1347.06%)
Mutual labels:  computer-science
Computer Science In Javascript
Computer science reimplemented in JavaScript
Stars: ✭ 2,590 (+7517.65%)
Mutual labels:  computer-science
Computer Science
🎓 Path to a free self-taught education in Computer Science!
Stars: ✭ 103,613 (+304644.12%)
Mutual labels:  computer-science
Developer Roadmap
Roadmap to becoming a developer in 2021
Stars: ✭ 180,811 (+531697.06%)
Mutual labels:  computer-science
Open Source Cs Python
Video discussing this curriculum:
Stars: ✭ 2,859 (+8308.82%)
Mutual labels:  computer-science
CS-Learning-Resources
Learning Resources for Those New to Computer Science
Stars: ✭ 19 (-44.12%)
Mutual labels:  computer-science
Fucking Algorithm
刷算法全靠套路,认准 labuladong 就够了!English version supported! Crack LeetCode, not only how, but also why.
Stars: ✭ 99,705 (+293150%)
Mutual labels:  computer-science
algorithms
A open source repository of different kinds of algorithms in c. Newbies are encouraged to contribute! Note: I made this code when i didn't have as much experience in programming
Stars: ✭ 33 (-2.94%)
Mutual labels:  computer-science
Cs Roadmap
My Computer Science Curriculum
Stars: ✭ 239 (+602.94%)
Mutual labels:  computer-science
Professional Programming
A collection of full-stack resources for programmers.
Stars: ✭ 15,936 (+46770.59%)
Mutual labels:  computer-science
Coding Interview University
A complete computer science study plan to become a software engineer.
Stars: ✭ 204,859 (+602426.47%)
Mutual labels:  computer-science
.net Big O Algorithm Complexity Cheat Sheet
Big-O complexities of common algorithms used in .NET and Computer Science.
Stars: ✭ 215 (+532.35%)
Mutual labels:  computer-science
fit3143-notes
Summary notes for my "Parallel Computing" class
Stars: ✭ 35 (+2.94%)
Mutual labels:  computer-science
Mitx 6.00.1x
Coding exercises and Problem Sets for MITx: 6.00.1x Introduction to Computer Science and Programming Using Python, EdX, Aug-Nov 2016
Stars: ✭ 213 (+526.47%)
Mutual labels:  computer-science
Cs Notes
📚 技术面试必备基础知识、Leetcode、计算机操作系统、计算机网络、系统设计
Stars: ✭ 143,620 (+422311.76%)
Mutual labels:  computer-science
DataStructures-Algorithms
A collections of many CP-based or DSA-based Questions that is stored various algorithms and datastructures to increase coding aptitutde. Anybody with a knack for coding can feel free to add more solutions and questions in the respective folders
Stars: ✭ 15 (-55.88%)
Mutual labels:  computer-science
wtm-udacity-scholars-nanodegree-resources
A List of Resources for Udacity Nanodegrees
Stars: ✭ 15 (-55.88%)
Mutual labels:  computer-science
TheJobInterviewGuide
A job guide to help developers get through interviews and get amazing jobs!
Stars: ✭ 267 (+685.29%)
Mutual labels:  computer-science

CSIndexbr

CSIndexbr (https://csindexbr.org) provides transparent data about Brazilian scientific production in Computer Science. We index full research papers published in selected conferences and journals. The papers are retrieved from DBLP.

Dependencies

CSIndexbr is implemented in Python 3.9 (backend scripts). The front-end uses HTML and pure Javascript.

We also use:

  • requests: "a simple, yet elegant HTTP library", which is used to retrieve data from DBLP.
  • xmltodict: "a Python module that makes working with XML feel like you are working with JSON", which is used to parse the XML files returned by DBLP.

Scripts

All these script must be called from "data" folder:

  • ./run se pl chi: update the papers (and related data) for the listed research areas (se, pl, and chi, in the example).

  • ./runall: update the papers (and related data) for all research areas

  • ./rundblp: download dblp files (xml, with papers) for all tracked professors

  • ./runcitations: update citations for for all research areas

Input files

** These files must be placed in the "data" folder: **

There are two "global" configuration files:

  • all-researchers.csv: Brazilian CS professors (i.e., from CS departments) whose papers are tracked by CSIndexber, with three columns:

    • Professor name (do not use "-" or accents in names)
    • University (do not use distinct names for the same university; e.g. PUC-Rio and PUC-RIO)
    • DBLP PID (see in this screenshot how to retrieve PIDs from DBLP profiles)
  • research-areas-config.csv: research areas covered by CSIndexbr, with two columns:

    • research area acronym (e.g., se)
    • minimum size of the conference papers indexed in this area (e.g., 10).

The following files are specific of a given research area (i.e., each area has all files listed next; although, in this list, we are using "se" as example):

  • se-confs.csv: conferences and journals indexed in a given research area ("se", in this case), with three columns:

    • venue name at DBLP:

      • for conferences, use "booktitle" XML entry, see example;
      • for journals, use "journal" XML entry; see example
    • venue name in the charts and tables generated by CSIndexbr

    • venue type, as follows:

      • 1: top-conference
      • 2: not used anynore
      • 3: "regular" conference (i.e., non-top)
      • 4: top-journal
      • 5: "regular" journal (i.e., non-top)
      • 6: magazine or journal that accept short papers (>= 6 pages)
      • 7: journals with low normalized-h5-index (see FAQ, for details)
  • se-black-list.txt: list of papers that must not be indexed, although they attend the basic indexing criteria. For example, they are papers published in other tracks, that is not the main research track of a conference. Each line contains the "url" XML field of the paper (see example)

  • se-white-list.txt: list of papers that must be indexed. For example, papers that do not have page numbers at DBLP metadata (see example)

Output files

** These files are generated in the "data" folder: **

Examples assuming "se" research area:

  • se-out-confs.csv: number of papers in indexed conferences
  • se-out-journals.csv: number of papers in indexed journals
  • se-out-profs-list.csv: professores with indexed papers in the area (and their departments)
  • se-out-profs.csv: number of professores with indexeded papers (in the area) per department
  • se-out-scores.csv: department scores (see formula in the FAQ)
  • se-out-papers.csv: metadata about indexed papers: year, venue, title, deparments, authors, doi, top or null (otherwise), journal (J) or conference (C), arxiv url or no_arxiv (otherwise), and number of citations

License:

MIT (for the source code) and CC BY-NC-SA 4.0 (for the data).

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].