cmusam / fortune500

Licence: other

Fortune 500 company lists since 1955 in CSV format, mostly parsed using Beautiful Soup

Programming Languages

75241 projects

Projects that are alternatives of or similar to fortune500

Download financial reports from SEC's EDGAR. Extract clean textual data from specific item sections and bootstrap your financial NLP research. Software from the research paper published in ECONLP 2021.

Stars: ✭ 71 (-8.97%)

Mutual labels: finance, business, economics

priceR

Economics and Pricing in R

Stars: ✭ 32 (-58.97%)

Mutual labels: finance, economics

Akshare

AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库

Stars: ✭ 4,334 (+5456.41%)

Mutual labels: finance, economics

akshare

AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库

Stars: ✭ 5,155 (+6508.97%)

Mutual labels: finance, economics

Bootstrapping Calculator

Do you have enough savings to fund your business?

Stars: ✭ 465 (+496.15%)

Mutual labels: finance, business

Fecon235

Notebooks for financial economics. Keywords: Jupyter notebook pandas Federal Reserve FRED Ferbus GDP CPI PCE inflation unemployment wage income debt Case-Shiller housing asset portfolio equities SPX bonds TIPS rates currency FX euro EUR USD JPY yen XAU gold Brent WTI oil Holt-Winters time-series forecasting statistics econometrics

Stars: ✭ 708 (+807.69%)

Mutual labels: finance, economics

ESL

The Economic Simulation Library provides an extensive collection of tools to develop, test, analyse and calibrate economic and financial agent-based models. The library is designed to take advantage of different computer architectures. In order to facilitate rapid iteration during model development the library can use parallel computation. Econ…

Stars: ✭ 36 (-53.85%)

Mutual labels: finance, economics

Fecon236

Tools for financial economics. Curated wrapper over Python ecosystem. Source code for fecon235 Jupyter notebooks.

Stars: ✭ 72 (-7.69%)

Mutual labels: finance, economics

mxfactorial

a payment application intended for deployment by the united states treasury

Stars: ✭ 36 (-53.85%)

Mutual labels: finance, economics

ferrumfix

Financial Information eXchange protocol implemented in Rust

Stars: ✭ 192 (+146.15%)

Mutual labels: finance

PT-Tracking

Aplicação para registo e acompanhamento de encomendas da CTT Expresso, automatiza a consulta online do estado de tracking para várias remessas e mantém um registo dos pagamentos referentes aos envios à cobrança. As remessas que requerem atenção, devido a atrasos na entrega ou na receção do pagamento correspondente, bem como os cheques cuja data …

Stars: ✭ 18 (-76.92%)

Mutual labels: business

lecture-julia.myst

Lecture source for "Quantitative Economics with Julia"

Stars: ✭ 24 (-69.23%)

Mutual labels: economics

value-investing-studies

Data Analysis Studies on Value Investing

Stars: ✭ 66 (-15.38%)

Mutual labels: economics

niqlow

design, solve and estimate discrete dynamic programs.

Stars: ✭ 16 (-79.49%)

Mutual labels: economics

TradeRepublicApi

Unofficial trade republic API

Stars: ✭ 134 (+71.79%)

Mutual labels: finance

gobacktest

event-driven backtesting framework written in golang

Stars: ✭ 179 (+129.49%)

Mutual labels: finance

lakshmi

Investing library and command-line interface inspired by the Bogleheads philosophy

Stars: ✭ 107 (+37.18%)

Mutual labels: finance

fibo

The Financial Industry Business Ontology (FIBO) defines the sets of things that are of interest in financial business applications and the ways that those things can relate to one another. In this way, FIBO can give meaning to any data (e.g., spreadsheets, relational databases, XML documents) that describe the business of finance.

Stars: ✭ 204 (+161.54%)

Mutual labels: finance

kkndme

kkndme聊房，数据整理自天涯。提供HTML、PDF和Markdown三种形式。

Stars: ✭ 752 (+864.1%)

Mutual labels: finance

DCF

Basic Discounted Cash Flow library written in Python. Automatically fetches relevant financial documents for chosen company and calculates DCF based on specified parameters.

Stars: ✭ 198 (+153.85%)

Mutual labels: finance

View All Similar Projects ➔

Fortune 500 company lists (1955-2019)

Usage

The dataset is under the csv/ directory.

The Fortune 500 is an annual list compiled and published by Fortune magazine that ranks 500 of the largest United States corporations by total revenue for their respective fiscal years.

How is this dataset collected?

The data come from a variety of sources, as I failed to find a single complete dataset that contains all lists from 1955 to 2018.

2019-

I'll be manually updating them.

2019: https://fortune.com/fortune500/2019/search/

2015-2018

http://fortune.com/fortune500/2015/list only loads the top 20 companies. More rows can be loaded by scrolling down to page bottom.

On the webpage, open Developer Tools.
Scroll to page bottom to load the next 30 companies (ranked 21 through 50).
In the Network panel, you can find a request whose type is Fetch.
Right click on the request to reveal link http://fortune.com/api/v2/list/1141696/expand/item/ranking/asc/20/30
After inspecting, we find that /20/30 means skip 20 and take 30, equivalent to getting row 21 through row 50.
It seems this API gives at most 100 rows per call. So, we can access http://fortune.com/api/v2/list/1141696/expand/item/ranking/asc/0/100 to get the first 100 companies, and http://fortune.com/api/v2/list/1141696/expand/item/ranking/asc/100/100 to get the next 100, and so on.
Finally, use the Python json package to parse the JSON files, and build the CSV files.

Data source:

homepage for 2015: http://fortune.com/fortune500/2015/list
1-100: http://fortune.com/api/v2/list/1141696/expand/item/ranking/asc/0/100
101-200: http://fortune.com/api/v2/list/1141696/expand/item/ranking/asc/100/100
201-300: http://fortune.com/api/v2/list/1141696/expand/item/ranking/asc/200/100
301-400: http://fortune.com/api/v2/list/1141696/expand/item/ranking/asc/300/100
401-500: http://fortune.com/api/v2/list/1141696/expand/item/ranking/asc/400/100

2013-2014

The data are from FortuneChina.com, the official website of Fortune magazine for China.

Data source:

url_2013 = 'http://www.fortunechina.com/fortune500/c/2013-05/06/content_154796.htm'
url_2014 = 'http://www.fortunechina.com/fortune500/c/2014-06/02/content_207496.htm'

2006-2012

The data are scrapped manually from the sources below, because the HTML pages containing 2006-2012 data do not follow a uniform structure.

Data source:

base = 'https://money.cnn.com/magazines/fortune/fortune500/{}/full_list/{}.html'
pages = ('index', '101_200', '201_300', '301_400', '401_500')
urls = [base.format(year, page) for year in range(2006,2013) for page in pages]

1955-2005

HTML sources are downloaded using urllib, parsed using Beautiful Soup, and saved as CSV.

Data source:

base = 'https://money.cnn.com/magazines/fortune/fortune500_archive/full/{}/{}.html'
urls = [base.format(year, page) for year in range(1955,2006) for page in (1,101,201,301,401)]

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

cmusam / fortune500

Programming Languages

Labels

Projects that are alternatives of or similar to fortune500

Fortune 500 company lists (1955-2019)

Usage

How is this dataset collected?

2019-

2015-2018

2013-2014

2006-2012

1955-2005