cupido/tasks/todo.md at e4da7691d5d83c8161986efef720eb25b6af298d

Giorgio e4da7691d5 Add offline tracking pipeline for video backlog

The 2024 video set in all_video_info_merged.xlsx covers 63 (date, machine)
sessions — 129 video instances — that have no auto-detectable targets, so
ROI placement requires manual reference-point selection. This commit adds
the three-stage pipeline that lets a user click for an hour, then walk
away while the tracker grinds overnight:

  1. build_video_inventory.py — scan /mnt/ethoscope_data/videos/ and join
     against the xlsx, producing data/metadata/video_inventory.csv

  2. pick_targets.py — interactive matplotlib/Tk picker. User clicks
     TOP/CORNER/LEFT (the L-shape ethoscope expects); after the third
     click the 6 ROI rectangles are drawn on top of the frame so geometry
     can be verified before saving. Also supports marking a video
     'unusable' (FOV wrong) so it's permanently skipped, frame stepping
     by ±1s/±5%/midpoint, point editing in --redo mode, and a crosshair
     cursor that survives matplotlib's per-motion cursor reset.

  3. track_videos.py — headless batch tracker. Reads the JSON sidecars,
     builds 6 ROIs from the HD-mating-arena geometry, runs MultiFlyTracker
     against the merged.mp4 via MovieVirtualCamera, writes SQLite DBs to
     data/tracked/. Idempotent (skips done DBs), parallel via --jobs,
     subclasses MovieVirtualCamera so frames stay BGR (MultiFlyTracker
     calls cvtColor(BGR2GRAY) without checking channel count).

Plus auto_detect_targets.py (fallback that runs ethoscope's auto-detector
in case any videos do have visible target dots), monitor_tracking.py
(progress + ETA from data/tracked/ ground truth, --watch for live view),
and tracking_geometry.py (single source of truth for the affine math
shared by picker and tracker).

requirements-tracking.txt pins the extra deps (opencv-python, openpyxl,
gitpython, netifaces, mysql-connector-python) — these are only needed
for the tracking pipeline, not the existing analysis notebooks.

Verified end-to-end on one of the user-picked videos: ~4000 rows/ROI in
a 120s slice, fly bounding boxes in the expected 800-2000 px² band.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

5.6 KiB

Raw Blame History

Task List

Completed Work

Priority: Bimodal Hypothesis Analysis

Phase 1: Per-ROI Feature Extraction

Phase 2: Distribution Visualization

Phase 3: Formal Bimodality Testing

Phase 4: Subgroup Identification

Phase 5: Effect Size Re-estimation

Maintenance Items

Phase: Offline Tracking of 2024 Video Backlog (added 2026-04-27)

Recap

Plan

Still TODO

Open questions / risks

Discovered During Work

5.6 KiB Raw Blame History Unescape Escape

Task List

Completed Work

Priority: Bimodal Hypothesis Analysis

Phase 1: Per-ROI Feature Extraction

Phase 2: Distribution Visualization

Phase 3: Formal Bimodality Testing

Phase 4: Subgroup Identification

Phase 5: Effect Size Re-estimation

Maintenance Items

Phase: Offline Tracking of 2024 Video Backlog (added 2026-04-27)

Recap

Plan

Still TODO

Open questions / risks

Discovered During Work

5.6 KiB

Raw Blame History