目录
目录README.md

Data-Processing-Toolkit-for-LLMs

中文  |  English  |

Overview

The Data Processing Toolkit for LLMs, published by Zhejiang Lab, contains tools designed for the data collection and processing to train LLMs. This toolkit is engineered to address the challenges associated with data preparation across diverse domains of LLM training. This project aims to help researchers enhance the efficiency of data preparation and reduce the cost of data set construction.

The data processing toolkit released in the current version includes:

Acknowledgement

If you use this toolkit in your research, please cite it as follows:

__special_katext_id_1__

If you have published research using this toolkit, please let us know and we will maintain a list of relevant publications to facilitate better communication among researchers.

Contact us

If you have any problems using the toolkit, please contact us via email at Zhejiang Lab

© 2024 Research Center for Intelligent Equipment of Zhejiang Lab

关于
17.8 MB
邀请码
    Gitlink(确实开源)
  • 加入我们
  • 官网邮箱:gitlink@ccf.org.cn
  • QQ群
  • QQ群
  • 公众号
  • 公众号

©Copyright 2023 CCF 开源发展委员会
Powered by Trustie& IntelliDE 京ICP备13000930号