cupido/tasks/todo.md at b273255dea68d1b1507e05e386cdee452ecc93e6

Giorgio Gilestro 23050360ea Remove data/raw/ entirely — all bulky data now under /mnt/data/projects/cupido/

Deleted the 5 stale pre-pipeline tracking DBs and the data/raw/ directory.
Dropped DATA_RAW from config.py; build_video_inventory now scans
TRACKING_OUTPUT_DIR for already-tracked sessions. Notebooks no longer
import DATA_RAW. README, PLANNING and todo updated to reflect that the
repo holds only code + small curated metadata, never bulky DBs.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

6.9 KiB

Raw Blame History

Task List

Completed Work

Priority: Bimodal Hypothesis Analysis

Phase 1: Per-ROI Feature Extraction

Phase 2: Distribution Visualization

Phase 3: Formal Bimodality Testing

Phase 4: Subgroup Identification

Phase 5: Effect Size Re-estimation

Maintenance Items

Phase: Offline Tracking of 2024 Video Backlog (added 2026-04-27)

Recap

Plan

Still TODO

Open questions / risks

Discovered During Work

Barrier-opening annotation for the 2024 batch (added 2026-04-30)

Metadata vocabulary normalization (done 2026-04-30)

6.9 KiB Raw Blame History

Task List

Completed Work

Priority: Bimodal Hypothesis Analysis

Phase 1: Per-ROI Feature Extraction

Phase 2: Distribution Visualization

Phase 3: Formal Bimodality Testing

Phase 4: Subgroup Identification

Phase 5: Effect Size Re-estimation

Maintenance Items

Phase: Offline Tracking of 2024 Video Backlog (added 2026-04-27)

Recap

Plan

Still TODO

Open questions / risks

Discovered During Work

Barrier-opening annotation for the 2024 batch (added 2026-04-30)

Metadata vocabulary normalization (done 2026-04-30)

6.9 KiB

Raw Blame History