The source code of the this is licensed under the Apache 2.0 license.
Summary of Terms
Attribution: You must give appropriate credit, provide a link to the license, and indicate if changes were made.
NonCommercial: You may not use the material for commercial purposes.
ShareAlike: If you remix, transform, or build upon the material, you must distribute your contributions under the same license as the original.
License Badge
5. Citation
If you’d like to use our benchmark or cite this paper, please kindly use the reference below:
@article{wu2024mobilevlm,
title={Mobilevlm: A vision-language model for better intra-and inter-ui understanding},
author={Wu, Qinzhuo and Xu, Weikai and Liu, Wei and Tan, Tao and Liu, Jianfeng and Li, Ang and Luan, Jian and Wang, Bin and Shang, Shuo},
journal={arXiv preprint arXiv:2409.14818},
year={2024}
}
MobileVLM
Paper Link
MobileVLM: A Vision-Language Model for Better Intra- and Inter-UI Understanding
News
1. Quick Start
Requirements
2. Mobile3M Dataset
Training Data
Training data is available at the following link: data. We will gradually upload data for all apps.
Training Json
Training json files are available at the following link: training_jsons.
Corpus Collection Script
To start collecting data, run the script
main/corpus/googleCreatDataset/arm_graph_para_lock.py.Example usage:
Running the above collection instruction requires the following additional installations.
Install Node.js and Appium:
Install graphical libraries:
sudo apt-get install xorgActivate the Python virtual environment:
source /path/to/new/virtual/environment/bin/activateInstall Appium Python Client 1.3.0:
pip install Appium-Python-Client==1.3.0Parameter Descriptions
-1for no recheck.Appium
Data Generation Code for Each Task
The code for generating data for each task can be found in the following directories:
Our Test Data
Our test data is available at data.
4. License
The dataset of this project is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) license.
The source code of the this is licensed under the Apache 2.0 license.
Summary of Terms
License Badge
5. Citation
If you’d like to use our benchmark or cite this paper, please kindly use the reference below: