Instructions to use cy0307/ropedia-xperience-10m-task-baselines with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Cosmos
How to use cy0307/ropedia-xperience-10m-task-baselines with Cosmos:
# No code snippets available yet for this library. # To use this model, check the repository files and the library's documentation. # Want to help? PRs adding snippets are welcome at: # https://github.com/huggingface/huggingface.js
- Notebooks
- Google Colab
- Kaggle
Xperience-10M Multi-Episode Data Access Status
This file summarizes what is needed before the Qwen3-Omni pilot becomes a real held-out multi-episode experiment.
Current State
| Item | Status |
|---|---|
| Minimum pilot gate | 32 valid Xperience-10M episodes |
| Current public local sample | 1 episode |
| Full dataset access | Granted; metadata-only Hugging Face audit completed |
| Current full-dataset metadata snapshot | 12,102 complete visible HF episodes across 802 complete sessions |
| Current selected pilot | 128 source-balanced episodes, 96/16/16 train/val/test |
| Current multi-episode data state | Selected 128-episode pilot package is verified for the exported subset: 3,808 windows across train/val/test, with 448 held-out test windows |
| Current Qwen3-Omni artifacts | Verified diagnostic held-out pilot; JSON validity is 87.50%, below the 98% target |
| Public raw-data redistribution | Not included |
The selected 128-episode pilot is now a completed diagnostic pipeline result, not a strong model-quality result. It verifies the data export, LoRA training, validation monitoring, held-out evaluation, and public-safe packaging path. The next model-quality milestone is a structured-output improvement pass that raises strict-JSON validity and improves action/subtask metrics.
Episode Requirement
A valid training episode needs annotation.hdf5 and at least
fisheye_cam0.mp4. A complete omni-model episode preferably includes all six
MP4 streams. visualization.rrd is a viewer artifact and is excluded from
training downloads.
Any future larger pilot should only be reported after:
- at least 32 valid episodes are prepared,
- train/test splits are separated by episode,
- manifest files record missing views and feature coverage,
- training finishes with metadata and progress logs,
- evaluation runs on held-out test episodes,
- predictions, metrics, confusion matrices, and a run report are committed.
The current verified 128-episode diagnostic package satisfies the reporting path above for the exported subset and records skipped selected episodes in the dataset manifest.
Discovery Snapshot
| Source | Valid episodes available to the current public project state |
|---|---|
| Local public sample | 1 |
| ModelScope discovery | 0 |
| Hugging Face gated metadata audit | 12,102 complete visible episodes |
The Hugging Face count is a metadata-only availability result. It does not mean that the raw files have been downloaded, staged, or used for multi-episode training yet.
Related Files
results/omni_finetune/MULTI_EPISODE_ACCESS_STATUS.mdresults/omni_finetune/FULL_DATASET_METADATA_AUDIT.mdresults/omni_finetune/full_dataset_metadata_audit.jsonresults/omni_finetune/XPERIENCE10M_128_EPISODE_SELECTION.mdresults/omni_finetune/XPERIENCE10M_128_DATA_PREPARATION_AND_FINETUNE_PLAN.mddocs/data/omni_finetune_verified_result.jsonresults/omni_finetune/verified_public/results/omni_finetune/xperience10m_128_episode_selection.jsonresults/omni_finetune/xperience10m_128_episode_download_files.txtresults/omni_finetune/source_discovery.jsonscripts/omni/discover_xperience10m_sources.pyscripts/omni/analyze_xperience10m_hf_metadata.pyscripts/omni/select_xperience10m_pilot_episodes.pyscripts/omni/relay_xperience10m_selection.pyscripts/omni/parallel_chunk_transfer.pyscripts/omni/audit_staged_xperience10m_content.pyscripts/omni/build_episode_manifest.py