All Projects → SystemsLab-Sapienza → pump-and-dump-dataset

SystemsLab-Sapienza / pump-and-dump-dataset

Licence: MIT License
Additional material for paper: Pump and Dumps in the Bitcoin Era: Real Time Detection of Cryptocurrency Market Manipulations, ICCCN '20

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to pump-and-dump-dataset

binance-pump-bot
Automation for Binance p&d(pump and dump) activity, ensures fastest purchase and provides auto selling functionality to lockdown profit during these events.
Stars: ✭ 112 (+69.7%)
Mutual labels:  pump, binance
Bitprophet
Node crypto trading platform for Binance exchange.
Stars: ✭ 166 (+151.52%)
Mutual labels:  telegram, binance
PumpBot
A bot to use in a pump & dump event on Binance.com
Stars: ✭ 185 (+180.3%)
Mutual labels:  pump, binance
Trading Server
A multi-asset, multi-strategy, event-driven trade execution and management platform for running many algorithms/bots at many venues simultaneously with unified risk management and reporting. Uses MongoDB for storage and Telegram for user notifications/trade consent.
Stars: ✭ 191 (+189.39%)
Mutual labels:  telegram, binance
binance-pump-alerts
Tracks prices of pairs on binance and notifies when price movements based on pre-defined parameters are met.
Stars: ✭ 65 (-1.52%)
Mutual labels:  pump, binance
greed
A customizable Telegram shop bot that accepts bitcoin payments
Stars: ✭ 31 (-53.03%)
Mutual labels:  telegram
vldc-bot
The official VLDC telegram group bot
Stars: ✭ 23 (-65.15%)
Mutual labels:  telegram
open-trading-platform-API
The Open Trading Platform API is an independent module for managing API requests from the UI module
Stars: ✭ 17 (-74.24%)
Mutual labels:  binance
Harmonbot
Multi-Platform Factotum Bot
Stars: ✭ 30 (-54.55%)
Mutual labels:  telegram
BIRL
BIRL: Benchmark on Image Registration methods with Landmark validations
Stars: ✭ 66 (+0%)
Mutual labels:  dataset
binance-chain-python
Binance chain SDK in Python
Stars: ✭ 22 (-66.67%)
Mutual labels:  binance
Awesome-Deepfakes-Detection
A list of tools, papers and code related to Deepfake Detection.
Stars: ✭ 30 (-54.55%)
Mutual labels:  dataset
FileStreamBot
Telegram File to Link Fastest Bot , Its Generate Direct Links Quickly
Stars: ✭ 99 (+50%)
Mutual labels:  telegram
TGPy
Run Python code right in your Telegram messages
Stars: ✭ 14 (-78.79%)
Mutual labels:  telegram
mutegram
Disable Telegram Desktop Taskbar Flashing
Stars: ✭ 26 (-60.61%)
Mutual labels:  telegram
snorkeling
Extracting biomedical relationships from literature with Snorkel 🏊
Stars: ✭ 56 (-15.15%)
Mutual labels:  dataset
ACVR2017
An Innovative Salient Object Detection Using Center-Dark Channel Prior
Stars: ✭ 20 (-69.7%)
Mutual labels:  dataset
pull facebook data for good
[DEPRECATED] Imitate an API for downloading data from Facebook Data For Good
Stars: ✭ 12 (-81.82%)
Mutual labels:  dataset
efb-wechat-docker
EFB WeChat Slave Docker Ver.
Stars: ✭ 48 (-27.27%)
Mutual labels:  telegram
Telega
C# Telegram MTProto Client
Stars: ✭ 38 (-42.42%)
Mutual labels:  telegram

Pump and dump dataset

This repository contains an extended version of the dataset used for the paper:

Pump and Dumps in the Bitcoin Era: Real Time Detection of Cryptocurrency Market Manipulations

If you use this dataset, or use the findings from the paper, please cite:

@INPROCEEDINGS{9209660,
  author={M. {La Morgia} and A. {Mei} and F. {Sassi} and J. {Stefa}},
  booktitle={2020 29th International Conference on Computer Communications and Networks (ICCCN)}, 
  title={Pump and Dumps in the Bitcoin Era: Real Time Detection of Cryptocurrency Market Manipulations}, 
  year={2020},
  pages={1-9},
  doi={10.1109/ICCCN49398.2020.9209660}
  }

The dataset

The dataset contains a list of pump and dumps arranged by groups on Telegram. See the paper for a more detailed description of the dataset generation process. The pump events are listed inside the pump_telegram.csv file.

The pump and dumps file (pump_telegram.csv)

Each row of this file contains:

  • symbol: the symbol (SYM) of the pumped coin.
  • group: the code of the group that arranged the pump and dump. More information about the groups can the found in the group.csv file
  • date: the pump and dump date
  • hour: the pump and dump hour expressed in UTC
  • exchange: the exchange targeted by the group

All the pump and dumps in the dataset are on the trading pair SYM/BTC.

We provide a script to download the transactions from the Binance exchange that we used to train the machine learning model. Each transaction contains the following fields, check the Binance documentation for further information:

  • timestamp: the timestamp of the transaction
  • datetime: the datetime of the transaction
  • side: indicates the type of the transaction: sell or buy
  • price: the trading price
  • amount: the amount of money traded
  • btc_volume: the trading volume expressed in BTC

The group file (group.csv)

  • group_name: The name of the pump and dump group
  • group_code: Abbreviation used in the dataset for the group
  • last_time_checked: Last time we retrieved the pump and dumps arranged by the group from the Telegram channel
  • telegram_link: Link to the Telegram channel of the group

Contribution

Your contribution is very welcome!

If you want to help us maintain the dataset updated, feel free to create a pull request.

Please, make sure to fill all the fields of the pump_telegram.csv in your pull request. If you add pump and dumps arranged by groups that are not in the group.csv file, please update this file as well.

We will review your pull request and merge it on the master branch or contact you for additional information.

Installation

Clone this repository and run:

pip3 install -r requirements.txt

To download all the transactions of the pump and dumps carried out on Binance run the downloader.py script.

python3 downloader.py

To compute the features for the machine learning model, run the features.py script.

python3 features.py

It generates 3 features files, one for each chunk size (5, 15, 25 secs). The features are stored in csv files contained in the features folder.

The computed features are the following:

  • StdRushOrders and AvgRushOrders: : Moving standard deviation and average of the number of rush orders in each chunk of the moving window.

  • StdTrades: Moving standard deviation of the number of trades.

  • StdVolumes and AvgVolumes: Moving standard deviation and average of volume of trades in each chunk of the moving window.

  • StdPrice and AvgPrice: Moving standard deviation and average of closing price.

  • AvgPriceMax : Moving average of maximal and minimum price in each chunk.

During our analysis, we found that in some cases, pump and dumps started before or after the organizer shared the signal. To account for this discrepancy, we look into the trading data and manually flag the chunk when the pump and dump actually started. For this reason, the labeled chunk may be up to 120 seconds before the time reported in the pump_telegram.csv file. This case is usually due to a pre-pump on the targeted coin by the admin. We provide the labeled features in the labeled features folder.

In order to execute our classifier on our labeled features run the classifier.py file

python3 classifier.py

Donation

If you enjoyed our work, you can buy us a cup of coffee donating on this Zcash wallet:

zs1uw83xkwr2rl3hrphxy0sdpnvlgusy9au940af5423f06ntxev6qqzu7hqwhmn5td5a035jzczt9

zs1uw83xkwr2rl3hrphxy0sdpnvlgusy9au940af5423f06ntxev6qqzu7hqwhmn5td5a035jzczt9

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].