DataMate is an enterprise-level data processing platform for model fine-tuning and RAG retrieval, supporting core functions such as data collection, data management, operator marketplace, data cleanin
版权所有:中国计算机学会技术支持:开源发展技术委员会
京ICP备13000930号-9
京公网安备 11010802032778号
DataMate All-in-One Data Work Platform
DataMate is an enterprise-level data processing platform for model fine-tuning and RAG retrieval, supporting core functions such as data collection, data management, operator marketplace, data cleaning, data synthesis, data annotation, data evaluation, and knowledge generation.
简体中文 | English
If you like this project, please give it a Star⭐️!
🌟 Core Features
🚀 Quick Start
Prerequisites
Docker Quick deploy
Clone the Code
Deploy the basic services
This project supports deployment via two methods: docker-compose and helm. After executing the command, please enter the corresponding number for the deployment method. The command echo is as follows:
If the machine you are using does not have make installed, please run the following command to deploy it:
Once the container is running, access http://localhost:30000 in a browser to view the front-end interface.
To list all available Make targets, flags and help text, run:
If you are in an offline environment, you can run the following command to download all dependent images:
Deploy Label Studio as an annotation tool
Build and deploy Mineru Enhanced PDF Processing
Deploy the DeerFlow service
Local Development and Deployment
After modifying the local code, please execute the following commands to build the image and deploy using the local image.
Uninstall
When running make uninstall, the installer will prompt once whether to delete volumes; that single choice is applied to all components. The uninstall order is: milvus -> label-studio -> datamate, which ensures the datamate network is removed cleanly after services that use it have stopped.
📚 Documentation
Core Documentation
Backend Documentation
Runtime Documentation
Frontend Documentation
🤝 Contribution Guidelines
Thank you for your interest in this project! We warmly welcome contributions from the community. Whether it’s submitting bug reports, suggesting new features, or directly participating in code development, all forms of help make a project better.
• 📮 GitHub Issues: Submit bugs or feature suggestions.
• 🔧 GitHub Pull Requests: Contribute code improvements.
📄 License
DataMate is open source under the MIT license. You are free to use, modify, and distribute the code of this project in compliance with the license terms.