All Projects → Yale-LILY → ConvoSumm

Yale-LILY / ConvoSumm

Licence: CC-BY-SA-4.0 license
No description, website, or topics provided.

Programming Languages

python
139335 projects - #7 most used programming language
Jupyter Notebook
11667 projects
shell
77523 projects
Cuda
1817 projects
javascript
184084 projects - #8 most used programming language
C++
36643 projects - #6 most used programming language

ConvoSumm

Data, code, and model checkpoints for the ACL 2021 paper ConvoSumm: Conversation Summarization Benchmark and Improved Abstractive Summarization with Argument Mining!

Data

The data can be accessed from this Google Drive link.

The data-non-processed contains the original, non-processed data and is 27MB, while data-processed contains the data for vanilla, -arg-filtered, and -arg-graph experiments, as well as model outputs, and is 611 MB.

Using the gdrive cli, download the folders with the following command

gdrive download --recursive 1HfyCMa1fQ5DkzME9RQZkytZQfyDjE1EK

The data can also be downloaded from this S3 bucket.

aws s3 cp --recursive s3://convosumm/data/ ./data

Code and Model Checkpoints

Please see this README for code details.

Model checkpoints can be downloaded from the S3 bucket (~80GB) or from this Google Drive link:

aws s3 cp --recursive s3://convosumm/checkpoints/ ./checkpoints
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].