Motivation
Existing Chinese datasets (Zero, Wukong) are 3+ years old with two critical issues:
- Temporal Irrelevance: Missing contemporary concepts
- Dead Links: High proportion of invalid image URLs
DanQing: A Modern Solution
DanQing provides ~100M image-text pairs from 2024-2025 web data.
Open-sourced under CC-BY 4.0 license: providing a foundation for next-generation Chinese AI models.