All Projects → inab → Biolitmap

inab / Biolitmap

Licence: lgpl-2.1
Code for the paper "BIOLITMAP: a web-based geolocated and temporal visualization of the evolution of bioinformatics publications" in Oxford Bioinformatics.

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Biolitmap

Awesome Datascience
📝 An awesome Data Science repository to learn and apply for real world problems.
Stars: ✭ 17,520 (+97233.33%)
Mutual labels:  data-science, data-mining, science, data-visualization
Book Socialmediaminingpython
Companion code for the book "Mastering Social Media Mining with Python"
Stars: ✭ 462 (+2466.67%)
Mutual labels:  data-science, data-mining, natural-language-processing, data-visualization
Data Science Toolkit
Collection of stats, modeling, and data science tools in Python and R.
Stars: ✭ 169 (+838.89%)
Mutual labels:  data-science, data-mining, natural-language-processing, data-visualization
Cookbook 2nd Code
Code of the IPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018 [read-only repository]
Stars: ✭ 541 (+2905.56%)
Mutual labels:  data-science, data-mining, data-visualization
Data Science With Ruby
Practical Data Science with Ruby based tools.
Stars: ✭ 549 (+2950%)
Mutual labels:  data-science, data-mining, data-visualization
Data Science Resources
👨🏽‍🏫You can learn about what data science is and why it's important in today's modern world. Are you interested in data science?🔋
Stars: ✭ 171 (+850%)
Mutual labels:  data-science, data-mining, data-visualization
Fantasy Basketball
Scraping statistics, predicting NBA player performance with neural networks and boosting algorithms, and optimising lineups for Draft Kings with genetic algorithm. Capstone Project for Machine Learning Engineer Nanodegree by Udacity.
Stars: ✭ 146 (+711.11%)
Mutual labels:  data-science, data-mining, data-visualization
Datascience
Curated list of Python resources for data science.
Stars: ✭ 3,051 (+16850%)
Mutual labels:  data-science, data-mining, data-visualization
Amazing Feature Engineering
Feature engineering is the process of using domain knowledge to extract features from raw data via data mining techniques. These features can be used to improve the performance of machine learning algorithms. Feature engineering can be considered as applied machine learning itself.
Stars: ✭ 218 (+1111.11%)
Mutual labels:  data-science, data-mining, data-visualization
Data Mining Conferences
Ranking, acceptance rate, deadline, and publication tips
Stars: ✭ 236 (+1211.11%)
Mutual labels:  data-science, data-mining, research
Pydataroad
open source for wechat-official-account (ID: PyDataLab)
Stars: ✭ 302 (+1577.78%)
Mutual labels:  data-science, data-mining, data-visualization
Cookbook 2nd
IPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018
Stars: ✭ 704 (+3811.11%)
Mutual labels:  data-science, data-mining, data-visualization
Pzad
Курс "Прикладные задачи анализа данных" (ВМК, МГУ имени М.В. Ломоносова)
Stars: ✭ 160 (+788.89%)
Mutual labels:  data-science, data-mining, data-visualization
Gwu data mining
Materials for GWU DNSC 6279 and DNSC 6290.
Stars: ✭ 217 (+1105.56%)
Mutual labels:  data-science, data-mining, data-visualization
Gensim
Topic Modelling for Humans
Stars: ✭ 12,763 (+70805.56%)
Mutual labels:  data-science, data-mining, natural-language-processing
Deepgraph
Analyze Data with Pandas-based Networks. Documentation:
Stars: ✭ 232 (+1188.89%)
Mutual labels:  data-science, data-mining, data-visualization
Dat8
General Assembly's 2015 Data Science course in Washington, DC
Stars: ✭ 1,516 (+8322.22%)
Mutual labels:  data-science, natural-language-processing, data-visualization
Awesome Scientific Python
A curated list of awesome scientific Python resources
Stars: ✭ 127 (+605.56%)
Mutual labels:  data-science, science, data-visualization
Orange3
🍊 📊 💡 Orange: Interactive data analysis
Stars: ✭ 3,152 (+17411.11%)
Mutual labels:  data-science, data-mining, data-visualization
Data Science
Collection of useful data science topics along with code and articles
Stars: ✭ 315 (+1650%)
Mutual labels:  data-science, natural-language-processing, data-visualization

BIOLITMAP: a web-based geolocated and temporal visualization of the evolution of bioinformatics publications

Description

Code for the web-based geolocated and temporal visualization of bioinformatics research (BIOLITMAP): http://socialanalytics.bsc.es/biolitmap/.

Paper accepted to Oxford Bioinformatics.

Directory structure

  • In /data the export of the BIOLITMAP SQL database is stored, as it was in December 2017.
  • In /scripts the tools used for the NLP tasks are stored
  • /scripts/src stores the codes related to the NLP, Clustering, Topic Modeling and Perplexity Analysis tasks.
  • /scripts/vis stores the visualization created using the final Latent Dirichlet Allocation model, by employing the pyLDAvis package.
  • /API stores the source codes of the REST API.

Getting the data from the REST API

We have deployed in our servers a REST API to gather the data from the map in JSON format, the following endpoints are available:

  1. http://socialanalytics.bsc.es/biolitmap-api/biolitmap/list - To get the complete list of the data.
  2. http://socialanalytics.bsc.es/biolitmap-api/biolitmap/filter/source/"journal" - To filter by source (e.g. Oxford Bioinformatics).
  3. http://socialanalytics.bsc.es/biolitmap-api/biolitmap/filter/year/[+-]"year" - To filter by year (e.g. 2010), you can use the + and - symbols to query for greater-equal or less-equal, respectively (e.g. +2010 or -2010).
  4. http://socialanalytics.bsc.es/biolitmap-api/biolitmap/filter/nameAffiliation/"institution" - To filter by research institution (e.g. University of Cambridge).

Example output from the /list endpoint

Getting the raw source data from Scopus

In order to obtain the raw source data with which this application has been built on, the following steps need to be followed:

  1. Access to the Scopus (www.scopus.com) document search tool
  2. Search the documents by using the following query:

ISSN ( 'JOURNAL_ISSN' ) AND ( LIMIT-TO ( PUBYEAR , 2017 ) OR LIMIT-TO ( PUBYEAR , 2016 ) OR LIMIT-TO ( PUBYEAR , 2015 ) OR LIMIT-TO ( PUBYEAR , 2014 ) OR LIMIT-TO ( PUBYEAR , 2013 ) OR LIMIT-TO ( PUBYEAR , 2012 ) OR LIMIT-TO ( PUBYEAR , 2011 ) OR LIMIT-TO ( PUBYEAR , 2010 ) OR LIMIT-TO ( PUBYEAR , 2009 ) OR LIMIT-TO ( PUBYEAR , 2008 ) OR LIMIT-TO ( PUBYEAR , 2007 ) OR LIMIT-TO ( PUBYEAR , 2006 ) OR LIMIT-TO ( PUBYEAR , 2005 ) ) AND ( LIMIT-TO ( EXACTKEYWORD , "Article" ) )

  1. Export the documents in CSV format with the 'Export' option.

Journals

In this first version of the application, the articles from the following journals are extracted:

  1. Oxford Bioinformatics (ISSN 1460-2059)
  2. Nucleic Acids Research (ISSN 1362-4962)
  3. BMC Bioinformatics (ISSN 1471-2105)
  4. BMC Genomics (ISSN 1471-2164)
  5. PLoS Computational Biology (ISSN 1553-734X)

Contact

You can contact the developers by sending an email to [email protected] or [email protected]

Preview

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].