All Projects → cmusam → fortune500

cmusam / fortune500

Licence: other
Fortune 500 company lists since 1955 in CSV format, mostly parsed using Beautiful Soup

Programming Languages

HTML
75241 projects

Projects that are alternatives of or similar to fortune500

edgar-crawler
Download financial reports from SEC's EDGAR. Extract clean textual data from specific item sections and bootstrap your financial NLP research. Software from the research paper published in ECONLP 2021.
Stars: ✭ 71 (-8.97%)
Mutual labels:  finance, business, economics
priceR
Economics and Pricing in R
Stars: ✭ 32 (-58.97%)
Mutual labels:  finance, economics
Akshare
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
Stars: ✭ 4,334 (+5456.41%)
Mutual labels:  finance, economics
akshare
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
Stars: ✭ 5,155 (+6508.97%)
Mutual labels:  finance, economics
Bootstrapping Calculator
Do you have enough savings to fund your business?
Stars: ✭ 465 (+496.15%)
Mutual labels:  finance, business
Fecon235
Notebooks for financial economics. Keywords: Jupyter notebook pandas Federal Reserve FRED Ferbus GDP CPI PCE inflation unemployment wage income debt Case-Shiller housing asset portfolio equities SPX bonds TIPS rates currency FX euro EUR USD JPY yen XAU gold Brent WTI oil Holt-Winters time-series forecasting statistics econometrics
Stars: ✭ 708 (+807.69%)
Mutual labels:  finance, economics
ESL
​The Economic Simulation Library provides an extensive collection of tools to develop, test, analyse and calibrate economic and financial agent-based models. The library is designed to take advantage of different computer architectures. In order to facilitate rapid iteration during model development the library can use parallel computation. Econ…
Stars: ✭ 36 (-53.85%)
Mutual labels:  finance, economics
Fecon236
Tools for financial economics. Curated wrapper over Python ecosystem. Source code for fecon235 Jupyter notebooks.
Stars: ✭ 72 (-7.69%)
Mutual labels:  finance, economics
mxfactorial
a payment application intended for deployment by the united states treasury
Stars: ✭ 36 (-53.85%)
Mutual labels:  finance, economics
ferrumfix
Financial Information eXchange protocol implemented in Rust
Stars: ✭ 192 (+146.15%)
Mutual labels:  finance
PT-Tracking
Aplicação para registo e acompanhamento de encomendas da CTT Expresso, automatiza a consulta online do estado de tracking para várias remessas e mantém um registo dos pagamentos referentes aos envios à cobrança. As remessas que requerem atenção, devido a atrasos na entrega ou na receção do pagamento correspondente, bem como os cheques cuja data …
Stars: ✭ 18 (-76.92%)
Mutual labels:  business
lecture-julia.myst
Lecture source for "Quantitative Economics with Julia"
Stars: ✭ 24 (-69.23%)
Mutual labels:  economics
value-investing-studies
Data Analysis Studies on Value Investing
Stars: ✭ 66 (-15.38%)
Mutual labels:  economics
niqlow
design, solve and estimate discrete dynamic programs.
Stars: ✭ 16 (-79.49%)
Mutual labels:  economics
TradeRepublicApi
Unofficial trade republic API
Stars: ✭ 134 (+71.79%)
Mutual labels:  finance
gobacktest
event-driven backtesting framework written in golang
Stars: ✭ 179 (+129.49%)
Mutual labels:  finance
lakshmi
Investing library and command-line interface inspired by the Bogleheads philosophy
Stars: ✭ 107 (+37.18%)
Mutual labels:  finance
fibo
The Financial Industry Business Ontology (FIBO) defines the sets of things that are of interest in financial business applications and the ways that those things can relate to one another. In this way, FIBO can give meaning to any data (e.g., spreadsheets, relational databases, XML documents) that describe the business of finance.
Stars: ✭ 204 (+161.54%)
Mutual labels:  finance
kkndme
kkndme聊房,数据整理自天涯。提供HTML、PDF和Markdown三种形式。
Stars: ✭ 752 (+864.1%)
Mutual labels:  finance
DCF
Basic Discounted Cash Flow library written in Python. Automatically fetches relevant financial documents for chosen company and calculates DCF based on specified parameters.
Stars: ✭ 198 (+153.85%)
Mutual labels:  finance

Fortune 500 company lists (1955-2019)

Usage

The dataset is under the csv/ directory.

The Fortune 500 is an annual list compiled and published by Fortune magazine that ranks 500 of the largest United States corporations by total revenue for their respective fiscal years.

How is this dataset collected?

The data come from a variety of sources, as I failed to find a single complete dataset that contains all lists from 1955 to 2018.

2019-

I'll be manually updating them.

2015-2018

http://fortune.com/fortune500/2015/list only loads the top 20 companies. More rows can be loaded by scrolling down to page bottom.

  1. On the webpage, open Developer Tools.
  2. Scroll to page bottom to load the next 30 companies (ranked 21 through 50).
  3. In the Network panel, you can find a request whose type is Fetch.
  4. Right click on the request to reveal link http://fortune.com/api/v2/list/1141696/expand/item/ranking/asc/20/30
  5. After inspecting, we find that /20/30 means skip 20 and take 30, equivalent to getting row 21 through row 50.
  6. It seems this API gives at most 100 rows per call. So, we can access http://fortune.com/api/v2/list/1141696/expand/item/ranking/asc/0/100 to get the first 100 companies, and http://fortune.com/api/v2/list/1141696/expand/item/ranking/asc/100/100 to get the next 100, and so on.
  7. Finally, use the Python json package to parse the JSON files, and build the CSV files.

Data source:

2013-2014

The data are from FortuneChina.com, the official website of Fortune magazine for China.

Data source:

url_2013 = 'http://www.fortunechina.com/fortune500/c/2013-05/06/content_154796.htm'
url_2014 = 'http://www.fortunechina.com/fortune500/c/2014-06/02/content_207496.htm'

2006-2012

The data are scrapped manually from the sources below, because the HTML pages containing 2006-2012 data do not follow a uniform structure.

Data source:

base = 'https://money.cnn.com/magazines/fortune/fortune500/{}/full_list/{}.html'
pages = ('index', '101_200', '201_300', '301_400', '401_500')
urls = [base.format(year, page) for year in range(2006,2013) for page in pages]

1955-2005

HTML sources are downloaded using urllib, parsed using Beautiful Soup, and saved as CSV.

Data source:

base = 'https://money.cnn.com/magazines/fortune/fortune500_archive/full/{}/{}.html'
urls = [base.format(year, page) for year in range(1955,2006) for page in (1,101,201,301,401)]
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].