All Projects → tongzhugroup → mddatasetbuilder

tongzhugroup / mddatasetbuilder

Licence: LGPL-3.0 license
A script to build reference datasets for training neural network potentials from given LAMMPS trajectories.

Programming Languages

python
139335 projects - #7 most used programming language
cython
566 projects
C++
36643 projects - #6 most used programming language

Projects that are alternatives of or similar to mddatasetbuilder

Awesome Python Chemistry
A curated list of Python packages related to chemistry
Stars: ✭ 410 (+1682.61%)
Mutual labels:  chemistry, molecular-dynamics
mdgrad
Pytorch differentiable molecular dynamics
Stars: ✭ 127 (+452.17%)
Mutual labels:  chemistry, molecular-dynamics
pylj
Teaching Utility for Classical Atomistic Simulation.
Stars: ✭ 23 (+0%)
Mutual labels:  chemistry, molecular-dynamics
Mcmd
Monte Carlo and Molecular Dynamics Simulation Package
Stars: ✭ 52 (+126.09%)
Mutual labels:  chemistry, molecular-dynamics
Molecular Design Toolkit
Notebook-integrated tools for molecular simulation and visualization
Stars: ✭ 123 (+434.78%)
Mutual labels:  chemistry, molecular-dynamics
Tensormol
Tensorflow + Molecules = TensorMol
Stars: ✭ 226 (+882.61%)
Mutual labels:  chemistry, molecular-dynamics
Chemlab
The chemistry library you were waiting for
Stars: ✭ 187 (+713.04%)
Mutual labels:  chemistry, molecular-dynamics
senpai
Molecular dynamics simulation software
Stars: ✭ 124 (+439.13%)
Mutual labels:  chemistry, molecular-dynamics
pem-dataset1
Proton Exchange Membrane (PEM) Fuel Cell Dataset
Stars: ✭ 48 (+108.7%)
Mutual labels:  chemistry
mbuild
A hierarchical, component based molecule builder
Stars: ✭ 119 (+417.39%)
Mutual labels:  molecular-dynamics
polyply 1.0
Generate input parameters and coordinates for atomistic and coarse-grained simulations of polymers, ssDNA, and carbohydrates
Stars: ✭ 59 (+156.52%)
Mutual labels:  md
contact map
Contact map analysis for biomolecules; based on MDTraj
Stars: ✭ 27 (+17.39%)
Mutual labels:  molecular-dynamics
MDBenchmark
Quickly generate, start and analyze benchmarks for molecular dynamics simulations.
Stars: ✭ 64 (+178.26%)
Mutual labels:  molecular-dynamics
mongodb-chemistry
Ideas for chemical similarity searches in MongoDB.
Stars: ✭ 23 (+0%)
Mutual labels:  chemistry
electrostatics.github.io
APBS & PDB2PQR - software for biomolecular electrostatics and solvation
Stars: ✭ 24 (+4.35%)
Mutual labels:  chemistry
unhexennium
A mobile chemistry calculator with Flutter.
Stars: ✭ 20 (-13.04%)
Mutual labels:  chemistry
spectrochempy
SpectroChemPy is a framework for processing, analyzing and modeling spectroscopic data for chemistry with Python
Stars: ✭ 34 (+47.83%)
Mutual labels:  chemistry
paper-terminal
Print Markdown to a paper in your terminal
Stars: ✭ 33 (+43.48%)
Mutual labels:  md
py4chemoinformatics
Python for chemoinformatics
Stars: ✭ 78 (+239.13%)
Mutual labels:  chemistry
foyer
A package for atom-typing as well as applying and disseminating forcefields
Stars: ✭ 78 (+239.13%)
Mutual labels:  molecular-dynamics

MDDatasetBuilder

DOI:10.1038/s41467-020-19497-z Citations python version PyPI codecov Research Group

MDDatasetBuilder is a script to construct reference datasets for the training of neural network potentials from given LAMMPS trajectories.

Complex reaction processes in combustion unraveled by neural network-based molecular dynamics simulation, Nature Communications, 11, 5713 (2020), DOI: 10.1038/s41467-020-19497-z

Author: Jinzhe Zeng

Email: [email protected]

Installation

MDDatasetBuilder can be installed with pip:

pip install mddatasetbuilder

The installation process should be very quick, taking only a few minutes on a “normal” desktop computer.

Usage

Simple example

A LAMMPS dump file should be prepared. A LAMMPS bond file can be added for the addition information.

datasetbuilder -d dump.ch4 -b bonds.reaxc.ch4_new -a C H O -n ch4 -i 25

Here, dump.ch4 is the name of the dump file. bonds.reaxc.ch4_new is the name of the bond file, which is optional. C H O is the element in the trajectory. ch4 is the name of the dataset. 25 means the time step interval and the default value is 1.

Then you can generate Gaussian input files for each structure in the dataset and calculate the potential energy & atomic forces (assume the Gaussian 16 has already been installed.):

qmcalc -d dataset_ch4_GJf/000
qmcalc -d dataset_ch4_GJf/001

Next, prepare a DeePMD dataset and use DeePMD-kit to train a NN model.

preparedeepmd -p dataset_ch4_GJf
cd train && dp train train.json

The runtime of the software depends on the amount of data. It is more suited to running on a server rather than desktop computer.

DP-GEN

DOI:10.1021/acs.energyfuels.0c03211 Citations

In a follow-up work, the MDDatasetBuilder package has been integrated with DP-GEN software as a part of the DP-GEN workflow:

dpgen init_reaction reaction.json machine.json

See DP-GEN documentation for details. Arguments of reaction.json can be found here. machine.json is described here, where reaxff_command is the LAMMPS command (lmp), build_command is the MDDatasetbuilder command (datasetbuilder), and fp_command is the Gaussian 16 command (g16 < input || :).

The genereated data can be used to continue DP-GEN concurrent learning workflow. Read Energy & Fuels, 2021, 35 (1), 762–769 for details.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].