Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

OpenDataLab

Team
non-profit
https://opendatalab.com/
OpenDataLab_AI
opendatalab
Activity Feed Request to join this org

AI & ML interests

OpenDataLab provides high-quality open datasets and tools for large models. China Large model corpus Data Alliance open source data service designated platform

Recent Activity

Carkham  updated a Space 2 days ago
opendatalab/TRivia-3B
qiujiantao  authored a paper 3 days ago
Unsupervised Topic Models are Data Mixers for Pre-training Language Models
qiujiantao  authored a paper 3 days ago
AICC: Parse HTML Finer, Make Models Better -- A 7.3T AI-Ready Corpus Built by a Model-Based HTML Parser
View all activity

Papers

Envision: Benchmarking Unified Understanding & Generation for Causal World Process Insights

AICC: Parse HTML Finer, Make Models Better -- A 7.3T AI-Ready Corpus Built by a Model-Based HTML Parser

View all Papers

Ren Ma's profile picture focus's profile picture Bin Wang's profile picture wufan's profile picture Qiu Jiantao's profile picture Lijun Wu's profile picture MA Runyuan's profile picture junyuan's profile picture Tianyao He's profile picture Zheng Liu's profile picture Wayne's profile picture Linke Ouyang's profile picture xiaomeng zhao's profile picture Haojiong Chen's profile picture yuan's profile picture cxz's profile picture

opendatalab 's Spaces 6

pinned
Running on L40S
496

MinerU OCR

📚

A data extraction tool to convert PDF to Markdown and JSON

7 days ago
Running on Zero
2

TRivia-3B

⭐

Convert table images into HTML tags with TRivia-3B

2 days ago
Running
8

CDM

📈

Evaluate formula recognition accuracy

Sep 28
Running
Featured
178

DocLayout YOLO

🚀

Demo for DocLayout-YOLO

Sep 8
Running

README

🏃

Apr 28
Build error
13

UniMERNet

👁

Recognize math equations from images

Sep 19, 2024
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs