Large-scale AMR graph dataset of more than 2 million sentences in the computational linguistics / natural language processing domain.
Download the data (.txt.gz files):
For details about the creation of the data, refer to the following paper:
M. Zhao, Y. Wang, and Y. Lepage. Large-scale AMR corpus with re-generated sentences: domain adaptive pre-training on ACL Anthology Corpus. In Proceedings of the 14th International Conference on Advanced Computer Science and Information Systems (ICACSIS 2022), pages ??–??, 2022.
If you use the data, please quote the above paper.