Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → jiangtiantu → Factorhub

jiangtiantu / Factorhub

Labels

jupyter-notebook

Projects that are alternatives of or similar to Factorhub

Attack Datasources

This content is analysis and research of the data sources currently listed in ATT&CK.

Stars: ✭ 71 (-2.74%)

Mutual labels: jupyter-notebook

Raccoon dataset

The dataset is used to train my own raccoon detector and I blogged about it on Medium

Stars: ✭ 1,177 (+1512.33%)

Mutual labels: jupyter-notebook

Coursera Specializations

Solutions to assignments of Coursera Specializations - Deep learning, Machine learning, Algorithms & Data Structures, Image Processing and Python For Everybody

Stars: ✭ 72 (-1.37%)

Mutual labels: jupyter-notebook

Ge tutorials

Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.

Stars: ✭ 72 (-1.37%)

Mutual labels: jupyter-notebook

Coordconv

Pytorch implementation of "An intriguing failing of convolutional neural networks and the CoordConv solution" - https://arxiv.org/abs/1807.03247

Stars: ✭ 72 (-1.37%)

Mutual labels: jupyter-notebook

Pydata pandas

A PyData workshop on pandas

Stars: ✭ 72 (-1.37%)

Mutual labels: jupyter-notebook

Cbe30338

Chemical Process Control

Stars: ✭ 71 (-2.74%)

Mutual labels: jupyter-notebook

Public

Stars: ✭ 72 (-1.37%)

Mutual labels: jupyter-notebook

Ml Starter Pack

A collection of Machine Learning algorithms written from sctrach.

Stars: ✭ 72 (-1.37%)

Mutual labels: jupyter-notebook

Cs231n

CS231n Convolutional Neural Networks for Visual Recognition (winter 2016) - Assignments

Stars: ✭ 72 (-1.37%)

Mutual labels: jupyter-notebook

Viztech

Plotnine replication of Financial Times Visual Vocabulary; Inspired by Vega

Stars: ✭ 72 (-1.37%)

Mutual labels: jupyter-notebook

Python Jupyter Apache Kafka Ksql Tensorflow Keras

Making Machine Learning Simple and Scalable with Python, Jupyter Notebook, TensorFlow, Keras, Apache Kafka and KSQL

Stars: ✭ 69 (-5.48%)

Mutual labels: jupyter-notebook

Mapclassify

Classification schemes for choropleth mapping.

Stars: ✭ 72 (-1.37%)

Mutual labels: jupyter-notebook

Hacktoberfest2020

Contribute for hacktoberfest 2020

Stars: ✭ 72 (-1.37%)

Mutual labels: jupyter-notebook

Visjs2jupyter

visJS2jupyter is a tool to bring the interactivity of networks created with vis.js into jupyter notebook cells

Stars: ✭ 72 (-1.37%)

Mutual labels: jupyter-notebook

Zaoqi Data

公众号：可视化图鉴

Stars: ✭ 72 (-1.37%)

Mutual labels: jupyter-notebook

Group Sparsity Sbp

Structured Bayesian Pruning, NIPS 2017

Stars: ✭ 72 (-1.37%)

Mutual labels: jupyter-notebook

Prml

Some IPython notebooks based on Bishop's "Pattern Recognition and Machine Learning" book

Stars: ✭ 72 (-1.37%)

Mutual labels: jupyter-notebook

Tensorflow Vgg

Re-implementation of VGG Network in tensorflow

Stars: ✭ 72 (-1.37%)

Mutual labels: jupyter-notebook

Allstate capstone

Allstate Kaggle Competition ML Capstone Project

Stars: ✭ 72 (-1.37%)

Mutual labels: jupyter-notebook

View All Similar Projects ➔

factorhub 因子交换小组

factorhub这个小组可能只适合做过多因子组合或因子挖掘的朋友。

我做了大半年的因子挖掘，然后总结出一个规律，没有好的数据，没有丰富的因子库是做不出好的超额的。所以我有个野一点的想法。我想建立一个小组，大家互相认可的话，可以交换下自己手上的因子，你们自己组队，沟通。或者我来介绍沟通都行，平等自愿。我微信debin16

我这边的话，开源了自己的因子框架，从数据库，到因子分析，都开源了（虽然是个小辣鸡）。但因子定义文件，我想以交换的方式互相交流。我愿意拿自己手上的3个因子换对方1个因子。每个因子的分层曲线，和多空收益我放在了factorfig 文件夹里。大家想要哪些因子可以挑。互相认可，我们就交换。

千粉大佬们愿意帮我推荐分享的话，我也愿意把因子文件直接送给您；有大佬愿意一起参与这个项也非常欢迎，互通有无。

因子框架

我代码很烂，水平也差。大佬们有意见随意提，我后面学习了就改进，也怕自己误人子弟。

**1.data：**一个简陋的数据库，以hdf5文件保存。提供基础数据，用于因子计算，和回测计算收益。建议自己本地安装好quantaxis，即可自行下载数据。 https://github.com/QUANTAXIS/QUANTAXIS

2.factor: 计算好的因子数据，以pkl 文件保存，文件太大，我上传到了百度网盘链接: https://pan.baidu.com/s/1HcRxXkHZ6ytyx6UThR5tcg 提取码: cust

3.analysis：因子分析工具，目前只开源了两个功能，分层画图，和计算超额

具体流程是：

#读取数据
datapath='E:\\Users\\Desktop\\factorhub\\data\\'
factorpath='E:\\Users\\Desktop\\factorhub\\factors\\'

data_hfq=pd.read_hdf(datapath+'data_hfq.h5','data_hfq')
data_bfq=pd.read_hdf(datapath+'data_bfq.h5','data_bfq')

`#对数据进行基本的处理`
`Open     = data_hfq["open"].unstack()`
`Close    = data_hfq["close"].unstack()`
`High     = data_hfq["high"].unstack()`
`Low      = data_hfq["low"].unstack()`
`Vol      = data_hfq["volume"].unstack()`
`Amount   = data_hfq["amount"].unstack()`
`chg_1_d  = Close.pct_change()`
`stock_info=QA.QA_fetch_stock_info(code=Open.columns.to_list())`
`sz       = data_bfq['close'].unstack().mul(stock_info["zongguben"],axis=1)`
`ltsz     = data_bfq['close'].unstack().mul(stock_info["liutongguben"],axis=1)`
`vwap     = Amount/Vol/100`

#去除涨跌停，去除停牌股
tradeable=data_bfq['amount'].apply(lambda x :1 if x>0 else np.nan)*(data_bfq['high']-data_bfq['low']).apply(lambda x :1 if x!=0 else np.nan)*chg_1_d.stack().apply(lambda x :1 if x<0.100 else np.nan)
tradeable=tradeable.unstack()

#获取基准
Benchmark=QA.QA_fetch_index_day_adv('000905',tradeable.index[0],tradeable.index[-1]).close
Benchmark.index=(Benchmark.index).get_level_values(0)
Benchmark=(Benchmark.pct_change(1)).shift(-1)
megedata=pd.DataFrame()
# megedata["period"]=Close.pct_change(1).shift(-1).stack()#以收盘价交易
megedata["period"]=Open.pct_change().shift(-2).stack()#以开盘价交易

#定义一个因子
def factor_simple():
    factor=-1*Close.pct_change(5)
 return factor
test_factor=factor_simple()

#分层画图
test_factor=test_factor.replace([np.inf, -np.inf], np.nan)
clean_factor_data=megedata
input_factor= test_factor*tradeable
input_factor=input_factor.stack()
clean_factor_data["factor"]=input_factor
clean_factor_data=clean_factor_data.dropna()

clean_factor_data["factor_quantile"]=clean_factor_data["factor"].groupby(level=0).apply(lambda x :((pd.qcut(x.rank(), 10, labels=False,duplicates='drop') + 1)))
df_factor_quantile=clean_factor_data.reset_index().groupby(['date','factor_quantile'])["period"].mean().unstack().cumsum()
df_factor_quantile.plot(figsize=(16,9),title="test_factor")

#不算复利,计算对冲收益
group_num=10
commision_fee=0.0

test_factor=test_factor.replace([np.inf, -np.inf], np.nan)
clean_factor_data=megedata
input_factor= test_factor*tradeable
input_factor=input_factor.stack()


clean_factor_data["factor"]=input_factor
clean_factor_data=clean_factor_data.dropna()
clean_factor_data["factor_quantile"]=clean_factor_data["factor"].groupby(level=0).apply(lambda x :((pd.qcut(x.rank(), 10, labels=False,duplicates='drop') + 1)))

long_portfolio_data = clean_factor_data[clean_factor_data['factor_quantile'] == group_num]
short_portfolio_data = clean_factor_data[clean_factor_data['factor_quantile'] == 1]

long_portfolio_rate_of_return = long_portfolio_data['period'].mean(level=0) - commision_fee
short_portfolio_rate_of_return = short_portfolio_data['period'].mean(level=0) - commision_fee
hedged_rate_of_return = long_portfolio_rate_of_return - short_portfolio_rate_of_return - 2 * commision_fee
hedged_with_Benchmark_return = long_portfolio_rate_of_return - Benchmark - commision_fee

long_cumulative_return = 1+long_portfolio_rate_of_return.cumsum()
short_cumulative_return = 1+short_portfolio_rate_of_return.cumsum()
hedged_cumulative_return = 1+hedged_rate_of_return.cumsum()
Benchmark_cumulative_return = 1+Benchmark.cumsum()
hedged_with_Benchmark_cumulative_return = 1+hedged_with_Benchmark_return.cumsum()

Return = pd.concat([long_cumulative_return,short_cumulative_return, hedged_cumulative_return, Benchmark_cumulative_return,hedged_with_Benchmark_cumulative_return], axis=1)
Return.columns = ['long','short','long-short','benchmark','long-benchmark']

Return=Return.dropna()
Return.plot(figsize=(16,9),title="test—factor")

基本上你自己定义一个因子，之后就直接开始研究了。我这个框架是学习alphalens 写的，因为alphalens 太慢了，所以，就自己实现了，要快些。没有做任何封装，理解起来容易些。虽然代码懒，但大概的步骤是没有错的，所有曲线没有计算手续费，没有计算对冲成本。

4.factor_born: 因子自动生成算法，基于deap，暂未开源 5.factor_fig: 因子分层曲线和超额收益曲线（全部按照单利计算） **6.mfm_operator：**一个算子文件，定义了些常见的算子

And More ?

欢迎加入quanthub 社区

https://zhuanlan.zhihu.com/p/148087260

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 73

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (0) 🔗