read.markets

10 commits 1 branch 0 tags 2.4 MiB

Author	SHA1	Message	Date
Giorgio Gilestro	6e7f57c6b2	phase G: data minimisation + passwordless auth + DeepSeek-first LLM Server no longer holds portfolios. Holdings live in the browser (localStorage); the server publishes an anonymous ticker_universe and a gzipped /api/universe payload identical for every authenticated user, so access patterns can't betray which tickers a user holds. AI commentary is generated ephemerally from the browser-supplied pie and the cost ledger row records no positions. Migrations 0009-0011 added the universe table and dropped positions / portfolio_snapshots / portfolios. Authentication is now e-mail OTP only. Migration 0010 dropped password_hash and email_verified (every active session is by construction proof of email control). The /signup endpoint is gone; signup and login share a single email-entry page. Email rendering is HTML+plain-text multipart with a shared brand palette (app/branding.py) asserted in sync with the CSS by a drift-detection test. LLM provider defaults to DeepSeek-direct (cheaper, api.deepseek.com) with OpenRouter as automatic fallback if DeepSeek fails. ai_log_job and indicator_summary_job now iterate the two tones (NOVICE, INTERMEDIATE) per cycle so the dashboard's tone toggle is instant; PROMPT_VERSION bumped to 6 with an educational anti-TA / anti-gambling stance baked into _CORE. NOVICE mode renders a curated glossary inline (CBOE VIX, yield curve, HY OAS, etc.) with JS-positioned tooltips that survive viewport edges and sticky bars. Model name and tokens hidden from the user UI; still recorded in StrategicLog.model and AICall for admin. Layout adds a sticky top nav, a sticky bottom markets bar (one chip per exchange with status LED + headline index + 1d change), and Phase H feedback reporting is queued in tasks/todo.md. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-18 14:16:57 +01:00
Giorgio Gilestro	480fd311c5	phase A: user accounts + session-cookie auth Replaces the static bearer-token gate with a real auth boundary. The existing CASSANDRA_TOKEN path is retained as an admin / scripting escape hatch — kept compatible by aliasing require_token to require_auth. - New users table (migration 0007): email, argon2 password_hash, tier, email_verified (declared but not enforced until phase E), settings_json for the tone/analysis/anchor knobs we'll wire in phase D. - app/services/auth_service.py: argon2-cffi password hashing with timing- attack-resistant authenticate() (always runs a hash verify even on unknown-email to deny a username-enumeration oracle). - app/auth.py rewritten: require_auth returns a CurrentUser with either is_admin=True (bearer path) or a User object (session path). Failing requests get 303 → /login for HTML, 401 for API. Sessions signed with itsdangerous against CASSANDRA_SESSION_SECRET; 14-day TTL. - app/routers/auth.py: /login, /signup, /logout. Login form preserves the ?next=… param for redirect-after-login. Signup respects a new CASSANDRA_SIGNUP_ENABLED flag. - Standalone /login + /signup templates (no app chrome). base.html grows a user chip + logout link in the header (reads request.state.current_user). Phase A's main known limitations are documented in the plan: email verification is declared but not enforced; session revocation is best-effort (cookie-only, not DB-backed). Both land in phase E. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-16 11:12:10 +01:00
Giorgio Gilestro	8a155ef157	phase B (2/2): CSV upload endpoint + drag-drop UI Completes Phase B. The full alternative-onboarding flow is now end-to-end: drop a T212 pie CSV → parser → InstrumentMap resolver → PortfolioSnapshot + Position rows, all without ever asking the user for broker credentials. - persist_pie() in app/services/csv_import.py: takes a ParsedPie, resolves each Slice via InstrumentMap, writes Portfolio + Snapshot + Position rows. Unmapped slices are still persisted using their CSV values and surfaced in the response for the UI to warn about. - POST /api/portfolios/upload: multipart endpoint accepting CSV file + optional portfolio_name + currency. 2 MiB cap. Returns import summary. - /upload page with drag-drop dropzone, file input fallback, and inline result panel showing invested/value/result + unmapped-slice warnings. - New "Import" link in the header nav. Verified end-to-end against the real T212 export: all 13 positions land with correct T212 tickers (incl. FPp_EQ for the Paris TotalEnergies listing the heuristic resolver picks), zero unmapped slices, totals reconcile to the penny. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-16 11:00:42 +01:00
Giorgio Gilestro	16e9f5f0cc	phase B (1/4): CSV parser + InstrumentMap (T212 shortcode → Yahoo ticker) First two slices of the multi-user roadmap (Phase B). Validates the core onboarding mechanic against the user's real T212 export before paying any auth/tenancy tax. CSV parser (app/services/csv_import.py): - Header-name matched (survives T212 reordering columns between exports), tolerant of UTF-8 BOM, dash/N/A/empty markers, thousand- separator commas, blank rows, zero-quantity stubs, missing Total row. - Returns ParsedPie(name, positions, invested, value, result) with derived avg_price + current_price per share in account currency. - 14 tests covering happy path on the real CSV + 13 edge cases. InstrumentMap (migration 0006 + app/services/instrument_map.py): - Catalogue table mapping T212 ticker → Yahoo ticker, populated by sync_from_t212() against the dev's read-only API key. Manual rows (manual=True) are protected from auto-overwrite. - Pure t212_ticker_to_yahoo() handles both suffix forms: single trailing exchange letter (l/a/p/d/m/s/...) and country code (US, DE, FR, IT, CA, ...). All 13 of the user's holdings + 15 case- coverage tests pass. - Live sync against T212 ingests 17,050 instruments (~2.2% unmappable on exotic exchanges; can extend the suffix map later). - resolve_slice() picks the right listing per shortName using a UK-friendly currency preference (GBX > GBP > EUR > USD). Resolved correctly for all 13 of the user's positions, including TTE on Paris vs the NYSE dual-listing. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-16 10:53:08 +01:00
Giorgio Gilestro	6dac8a2c7f	cadence: support multiple active windows; Asia window commented out Refactored CadencePolicy.active_start_hour/active_end_hour into a tuple of (start, end) hour pairs so additional regional windows can be added without code changes. Default keeps EU/US-only behaviour identical. The Asia window (00:00-08:00 UTC — Tokyo + HK + Shanghai) is included as a commented-out tuple in the dataclass default. Uncomment one line to enable hourly AI cadence during the Asia session as well. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-16 10:20:05 +01:00
Giorgio Gilestro	40cfb50e37	market-aware AI cadence + incremental log updates Two changes that together cut OpenRouter spend ~50% and give the daily log temporal awareness. 1. CadencePolicy (app/services/cadence.py): expensive AI jobs only fire hourly during the EU/US active window (Mon-Fri 07-21 UTC). Off-hours weekdays throttle to every 4h; weekends to every 12h. ai_log_job and indicator_summary_job both consult the policy before doing real work; market/news/portfolio ingest jobs stay hourly (cheap, no API cost). Skipped runs land in job_runs with status 'skipped' and the throttle reason in error. 2. Update mode for ai_log_job: when an earlier log exists for the current UTC day, it's passed to the model as 'Earlier log from today (generated HH:MM UTC)'. The system prompt grows an Update mode section instructing the model to revise — not restart — and anchor on what has CHANGED since the earlier draft. The TL;DR leads with intra-day change when meaningful, the watch list evolves rather than restarts. PROMPT_VERSION bumped to 5. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-16 10:17:39 +01:00
Giorgio Gilestro	2f223b75a3	strip prompt-echo leakage in indicator summaries DeepSeek occasionally regurgitates the system prompt verbatim ("Constraints: ≤60 words...", "Example good: ..."). Three-pronged fix: 1. Removed the inline good/bad example blocks from the per-group and aggregate system prompts — DeepSeek was treating them as templates to copy. The hard constraints alone are clear enough. 2. Expanded the LEAK_PATTERNS list to catch the prompt-label echoes that still occasionally slip through ("Key observations:", "The indicators are:", "Must cite ...", "Should give ...", bare "Key:"). Cleanup now runs up to 6 passes for compound leakage. 3. Added looks_like_leakage() — if the cleaned output still contains tell-tale phrases ("≤60 words", "instructions:", etc.), the summary is skipped rather than persisted. Logs a 'leakage_detected' warning and an ai_calls row with status=leaked so we can see the failure rate over time. The previous good summary stays visible. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-16 10:10:24 +01:00
Giorgio Gilestro	4e7e4981e3	add ECB Data Portal source; group-aware stale thresholds ECB Statistical Data Warehouse joins as a 5th data source — open API, no key, daily euro-area yield curve data. Symbol format 'ECB:dataset/series_key', e.g. 'ECB:YC/B.U2.EUR.4F.G_N_A.SV_C_YM.SR_10Y' for daily 10y AAA spot rate. Bonds tab adds ECB EZ 10y AAA + 2y AAA so there's at least some currently-fresh European sovereign data alongside the US Treasuries. Country-specific yields (Bund/OAT/BTP/Gilt/JGB) remain on Eurostat/FRED monthly mirrors — no free daily source exists for those. Stale threshold is now per-group instead of a flat 90 days. Daily-tape groups (bonds, rates, equity, etc.) flag stale after a week or three; monthly groups (economy, macro, valuation) stay at 60-90 days. The bonds tab will now correctly show 30-60 day-old country yields as stale next to the daily US/ECB ones. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-15 23:13:58 +01:00
Giorgio Gilestro	1edf9cad41	add Eurostat + UK ONS sources; valuation/bubble/economy/bonds groups; aggregate read; market-open header Three new data sources hooked into the existing SOURCES registry. All open APIs, no keys: - EUROSTAT: prefix EUROSTAT:dataset?dim=val&... — current EU bond yields (Bund/OAT/BTP/EZ) and Eurozone economic indicators that FRED's OECD-mirror series stopped updating in 2022-2023. - ONS: prefix ONS:topic/cdid/dataset — current UK CPI, unemployment, GDP, industrial production. Replaces the 5+ month-stale FRED LRHUTTTTGBM156S mirror. New indicator groups in default.toml feed the strategic/fundamental lens we converged on: valuation (CAPE/Buffett anchors), bubble_watch (SKEW/VVIX/RSP vs SPY/HYG vs TLT/IPO/crypto), economy (multi-region, ALL current-or-stale-flagged), bonds (UK/EU/US/JPN sovereign yields). Indicator panel now opens with an AI "read" interpretation per group (generated hourly at :07 UTC alongside an aggregate cross-group read shown in the dashboard header). The aggregate is grounded by a markets strip — NYSE/LSE/Frankfurt/Tokyo/HK/Shanghai with open/closed LEDs and next-open countdown, computed locally from each exchange's tz. Other UX bits: indicator-row tooltips populated from TOML notes; rows whose last observation is >90 days old get a 'stale' chip; ghost symbols (in DB but no longer in TOML) filtered out of the panel; Eurostat/ONS symbols display as short codes rather than the full API path. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-15 23:07:42 +01:00
Giorgio Gilestro	a10409c02b	initial commit — cassandra v0.1 Containerised macro-strategy dashboard: 4-panel web UI (indicators, portfolio, flash news, AI strategic log), MariaDB store, hourly ingestion jobs, OpenRouter-backed AI analysis. Ports the four prototype scripts in the parent dir (market_pulse, flash_news, trading212, strategic_log) into async services backed by a persistent DB and served via FastAPI + Jinja2 + HTMX. APScheduler runs as a separate compose service for crash-safety and easier restarts. Portfolio composition + position names come live from Trading 212; news per-ticker headlines reuse those names. Tone (NOVICE/INTERMEDIATE/ PRO) and analysis style (DRY/SPECULATIVE) are env-configurable and stored on each log row so historical entries show what produced them. Default model is deepseek/deepseek-v4-flash (overridable via env). Light/dark theme toggle, sans-serif for prose surfaces, monospace for data. Bearer-token auth, OpenRouter monthly cost cap, RSS feeds auto- disabled on consecutive failures. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-15 21:56:10 +01:00