The Chinese-LiPS dataset is a multimodal audio-visual speech recognition (AVSR) dataset. The dataset includes:
Split | Duration (hours) | Segment | Speaker |
---|---|---|---|
Train | 85.37 | 30,341 | 175 |
Validation | 5.35 | 1,959 | 11 |
Test | 10.12 | 3,908 | 21 |
All | 100.84 | 36,208 | 207 |
You can access the dataset using the links below:
The dataset includes three main modalities: audio, slide video, and lip-reading video. The dataset is organized into several files:
train.zip/test.zip/val.zip ├── ID1_age_gender_topic/ │ ├── WAV/ │ │ ├── ID1_age_gender_topic_001.json # Audio segment annotation file │ │ ├── ID1_age_gender_topic_001.wav # Audio file (48 kHz) │ ├── PPT/ │ │ ├── ID1_age_gender_topic_001_PPT.mp4 # Slide video file (1080p 30fps) │ ├── FACE/ │ │ ├── ID1_age_gender_topic_001_FACE.mp4 # Lip-reading video file (720p 30fps) ├── ID2_age_gender_topic/ │ ├── WAV/ │ │ ├── ID2_age_gender_topic_001.json │ │ ├── ID2_age_gender_topic_001.wav │ ├── PPT/ │ │ ├── ID2_age_gender_topic_001_PPT.mp4 │ ├── FACE/ │ │ ├── ID2_age_gender_topic_001_FACE.mp4 ├── ...
asr.zip ├── topic1/ │ ├── ID1_age_gender_topic/ │ │ ├── WAV/ │ │ │ ├── ID1_age_gender_topic_001.json │ │ │ ├── ID1_age_gender_topic_001.wav │ │ ├── PPT/ │ │ │ ├── ID1_age_gender_topic_001_PPT.mp4 │ │ ├── FACE/ │ │ │ ├── ID1_age_gender_topic_001_FACE.mp4 │ ├── ID2_age_gender_topic/ │ │ ├── ... ├── topic2/ │ ├── ...
The TOPIC field is abbreviated in Chinese as follows: DZJJ = E-sports & Gaming, JKYS = Health & Wellness, KJ = Science & Technology, LY = Travel & Exploration, QC = Automobile & Industry, RWLS = Culture & History, TY = Sports & Competitions, YS = Movies & TV Series, ZX = Others.
@misc{zhao2025chineselipschineseaudiovisualspeech, title={Chinese-LiPS: A Chinese audio-visual speech recognition dataset with Lip-reading and Presentation Slides}, author={Jinghua Zhao and Yuhang Jia and Shiyao Wang and Jiaming Zhou and Hui Wang and Yong Qin}, year={2025}, eprint={2504.15066}, archivePrefix={arXiv}, primaryClass={cs.MM}, url={https://arxiv.org/abs/2504.15066} }
If you have any questions or suggestions, feel free to reach out via email at zhao1jing1hua@gmail.com.