Proactive Assistant

Proactive Assistant is a FastAPI application that predicts likely ride and food actions before the user explicitly requests them. It combines:

Persistent behavioral memory in SQLite
Authenticated browser automation with Playwright
Deterministic trigger evaluation running in the background
Live data hydration for Uber and Swiggy
A single-page frontend for confirmation, feedback, and inspection

The system is designed around two product surfaces:

Ride Assistant: predicts likely next rides, validates them against live pricing and ETA data, and exposes a one-tap confirmation flow
Food Assistant: predicts likely next orders, checks live restaurant availability and delivery timing, and presents alternatives when the preferred option is weak or unavailable

Local Setup

Prerequisites

Python 3.10 or newer
Chromium installed through Playwright
Network access for third-party pages and public APIs

Install

cd proactive_assistant
python -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt
playwright install chromium

Environment variables

Create .env in the project root:

GROQ_API_KEY=
GROQ_MODEL=openai/gpt-oss-120b
GEOCODING_API=
PORT=8000

Notes:

GROQ_API_KEY is optional. Without it, the app falls back to deterministic explanations and ranking.
GEOCODING_API is used for geocoding support.
PORT is currently not consumed by run.py; local dev starts on 8001 unless you run uvicorn manually.

Run

Using the provided entry point:

python run.py

This starts the server on:

http://localhost:8001

Equivalent uvicorn invocation:

uvicorn proactive_assistant_app.app:app --host 0.0.0.0 --port 8001 --reload

First-run sequence

Open http://localhost:8001/login
Authenticate into the application shell
Connect Uber from the rides flow
Connect Swiggy from the food flow
Run history sync
Let pattern extraction populate behavioral memory
Reload the main UI and inspect /rides, /food, and /api/trigger/log

Walkthrough

Loom: https://www.loom.com/share/e2c98321cf0f48b69455ce2a091ebe58

What the System Actually Does

Account connection

The application does not ask the user to export cookies or manually inject credentials. Instead, account connection is performed inside a remote Playwright browser session:

Uber login is opened in a mobile-emulated Chromium context
Swiggy login is opened in a desktop Chromium context
The frontend browser viewer streams screenshots from the headless page
User clicks and keystrokes are relayed back into Playwright over API calls
Once login is detected, the authenticated session is persisted locally and reused for subsequent scraping

Uber persists cookies to proactive_assistant_app/browser_session/cookies.json.

Swiggy persists Playwright storage state to proactive_assistant_app/browser_session/swiggy_storage_state.json.

Background prediction loop

At application startup:

SQLite schema initialization runs
Pattern extraction is launched in a background thread
A persistent trigger watcher starts

The trigger watcher evaluates patterns every 60 seconds and decides whether to emit a proactive suggestion.

Prediction mechanics

The current implementation is primarily deterministic:

ride patterns are derived from historical departure windows and destination clusters
food patterns are derived from historical order windows, cuisine frequency, and restaurant preference scores
live data is fetched only after the system has a candidate worth showing

Groq is optional and is used in two narrow places:

concise natural-language reason generation
ranking current ride candidates when an API key is configured

If no Groq key is provided, the system remains functional and falls back to deterministic ranking.

Architecture

Backend

FastAPI application: proactive_assistant_app/app.py
Entry point: run.py
Default local dev port: 8001

Persistence

SQLite database: proactive_assistant_app/assistant.db
WAL mode enabled
Stores raw history, extracted patterns, trigger logs, suggestions, feedback, scraper cache, scraper snapshots, platform connection metadata, and app settings

Core modules

app.py: API surface, startup lifecycle, authentication cookie handling, live context aggregation, page serving
database.py: persistence layer and behavioral memory primitives
pattern_engine.py: extraction of ride and food patterns plus feedback reweighting
trigger_watcher.py: background loop that decides when to surface suggestions
suggestion_builder.py: converts triggers into actionable suggestions and hydrates them with live data
uber_client.py: Uber Playwright login, history scraping, live estimate scraping, deeplink generation
food_client.py: Swiggy Playwright login, order-history scraping, top-restaurant scraping, live restaurant matching
data_fetcher.py: timeouts, fallback execution, cache reuse, stale-data handling, scraper health tracking
suggestion_engine.py: legacy / auxiliary Groq-backed ride ranking path still used by current ride suggestion APIs
reason_generator.py: optional one-line explanation generation with deterministic fallback

Frontend

Static SPA served from proactive_assistant_app/static/
Main files:
- index.html
- app.js
- styles.css
- login.html

The frontend is route-driven and exposes:

/
/rides
/food

Data Model

The SQLite schema is not just a storage layer for scraped history. It is the system's memory.

Raw history tables

rides
orders

These store normalized Uber ride history and Swiggy order history, including timestamps, pricing, coordinates, labels, and metadata required for downstream inference.

Learned pattern tables

departure_patterns
destination_clusters
order_patterns
restaurant_patterns
cuisine_by_day

These tables represent the learned behavioral model used by the trigger engine.

Triggering and feedback tables

trigger_log
suggestions
dismissed_suggestions
confirmed_suggestions
pattern_feedback

These tables allow the system to:

know when it last triggered
know what it showed
know whether the user confirmed, dismissed, edited, or ignored the suggestion
suppress patterns that are repeatedly rejected

Runtime resilience tables

scraper_cache
scraper_snapshots
platform_connections
app_settings

These support session persistence, stale fallback data, scraper health reporting, and connection state.

Trigger Logic

Trigger evaluation lives in trigger_watcher.py.

Poll frequency

The watcher runs every 60 seconds

Ride triggers

Ride suggestions are considered when:

the current time is within a 20-minute tolerance of a learned departure window
the pattern confidence passes threshold
the user has not already booked a ride in that same window today
the pattern is not suppressed due to repeated dismissals

The watcher also compares live route travel time to historical travel time:

if live travel time exceeds historical average by 25 percent, the trigger is strengthened
this adds traffic_deviation as a trigger reason
it can shift the recommended departure earlier

Ride annoyance controls:

base cooldown: 45 minutes
confidence threshold increases after repeated dismissals
4 dismissals in the recent window suppress the pattern
pending suggestions expire after 10 minutes and count as ignored feedback

Food triggers

Food suggestions are considered when:

the current time is within a 15-minute tolerance of a learned ordering window
the pattern confidence passes threshold
the user has not already ordered from that restaurant in the same window today
the pattern is not suppressed due to dismissals

The watcher also compares current delivery ETA to historical delivery time:

if current ETA exceeds historical ETA by 30 percent, the trigger is strengthened
this adds delivery_delay as a trigger reason

Food annoyance controls:

base cooldown: 30 minutes
repeated dismissals increase the threshold
4 dismissals suppress the pattern

Pattern Extraction

Pattern extraction lives in pattern_engine.py and runs:

at startup
after ride history sync
after food history sync

Ride pattern extraction

The engine:

clusters destinations by label similarity and geographic proximity
converts departure times into 15-minute bins
groups rides by weekday, hour bin, destination cluster, platform, and ride type
stores only patterns with enough support to be meaningful

Food pattern extraction

The engine:

groups orders by weekday and 15-minute order windows
tracks cuisine frequency per day
builds restaurant preference scores using frequency and recency
stores recent, high-signal restaurant preferences for live matching

Feedback reweighting

When the user confirms or dismisses a suggestion, the engine updates pattern state:

confirmations increase frequency and refresh recency
dismissals increase dismissal count
repeated dismissals suppress the pattern
confirmed edits can create new pattern variants

Live Data Hydration and Fallback Strategy

The system does not trust scraped live data as always available.

General fetch strategy

data_fetcher.py wraps critical live data calls with:

an 8-second primary timeout
a 5-second fallback timeout
scraper cache writes
stale-cache reuse
scraper health tracking

If both primary and fallback fail:

the most recent cached payload is returned as a CachedResult if available
otherwise the result becomes DataUnavailable

Uber live data

For rides, the system fetches:

live estimates from m.uber.com
network responses containing fare and product data
DOM candidates as a secondary extraction channel

If precise live data is missing, the suggestion still survives with:

historical destination confidence
deeplink generation
explicit fallback messaging

Swiggy live data

For food, the system fetches:

top restaurants from Swiggy internal listing endpoints
live restaurant matching for the preferred restaurant and item
delivery time and deeplink data

If the preferred restaurant is weak or unavailable:

the builder switches to historical restaurant alternatives

UI-visible fallback behavior

The UI is designed to degrade gracefully:

geolocation falls back to IP lookup
weather, air quality, and reverse geocoding reuse cached values when live lookups fail
food and ride panels render non-blocking placeholders while background fetches resolve
cached or stale scraper results are still surfaced when they are better than showing nothing

Playwright Integration

Uber

Uber automation in uber_client.py has two distinct modes:

login/browser-viewer mode
scraping/live-quote mode

Login flow:

launches headless Chromium with mobile emulation
streams screenshots to the frontend
relays click, type, and key events from the UI
tracks popup windows for OAuth flows
treats login as complete when redirected away from auth pages or when session cookies such as sid / csid are present

History sync:

opens https://riders.uber.com/trips
captures CSRF tokens from the page's own requests
replays Uber's internal GraphQL Activities query in-page
paginates through nextPageToken
augments records with trip-detail extraction when needed

Live quotes:

opens https://m.uber.com/looking?...
listens to fare/product network responses
parses DOM price cards as a secondary extraction channel

Swiggy

Swiggy automation in food_client.py also has two modes:

login/browser-viewer mode
authenticated same-origin data fetch mode

Login flow:

launches headless Chromium in desktop mode
persists Playwright storage_state
verifies login by checking account URLs or account/profile DOM markers

History sync:

reuses the authenticated browser context
loads the orders page
executes same-origin fetch requests inside the page for /dapi/order/all
paginates order history
falls back to DOM scraping if the internal API path fails

Restaurant discovery:

fetches /dapi/restaurants/list/v5 from within the authenticated page context
falls back to homepage DOM scraping if the endpoint fails

Authentication

The app uses a simple application-level auth gate for access to the UI:

manual login via /api/auth/login
optional Google Sign-In via /api/auth/google
session stored in an HTTP-only cookie

Google Sign-In is optional. If GOOGLE_CLIENT_ID is not configured, the rest of the application still works.

This auth layer controls access to the SPA. Platform-specific authentication for Uber and Swiggy is separate and managed via Playwright sessions.

API Overview

UI and auth

GET /login
GET /
GET /rides
GET /food
GET /api/auth/status
POST /api/auth/login
GET /api/auth/google/config
POST /api/auth/google
POST /api/auth/logout

Live context

GET /api/context/live
GET /api/geocode

Uber

GET /api/uber/status
POST /api/uber/login
GET /api/uber/screenshot
POST /api/uber/click
POST /api/uber/type
POST /api/uber/key
POST /api/uber/finish-login
POST /api/uber/sync-history
GET /api/uber/history

Food

GET /api/food/status
POST /api/food/login/{provider}
GET /api/food/screenshot
POST /api/food/click
POST /api/food/type
POST /api/food/key
POST /api/food/finish-login/{provider}
POST /api/food/sync-history
GET /api/food/history
GET /api/food/top-restaurants

Patterns and predictions

GET /api/history/sync
GET /api/patterns/summary
GET /api/ride/patterns
GET /api/food/patterns
GET /api/suggestions/current
GET /api/ride/suggestion
GET /api/food/suggestion
POST /api/suggestions/{suggestion_id}/confirm
POST /api/suggestions/{suggestion_id}/dismiss
POST /api/ride/confirm
POST /api/ride/dismiss
POST /api/food/confirm
POST /api/food/dismiss
GET /api/trigger/log

Health and realtime

GET /api/health
GET /health
WS /ws/suggestions

Operating Notes

Session artifacts

Playwright session files are stored under:

proactive_assistant_app/browser_session/

These files contain authenticated browser state and should be treated as sensitive local credentials.

Local database

SQLite database path:

proactive_assistant_app/assistant.db

To reset local state, remove the database and browser session artifacts, or use the provided cleanup script:

./clear_data.sh

Debug output

Uber history scrape debug output is written to:

proactive_assistant_app/scrape_debug/uber_history_last.json

Health inspection

Use:

/health
/api/health
/api/trigger/log
/api/patterns/summary

to inspect scraper health, trigger behavior, and extracted memory state.

Testing

The repository includes targeted tests under tests/, for example:

tests/test_live_uber_estimates.py
tests/test_suggestion_builder_geocoding.py
tests/test_ola_client.py
tests/test_uber_history_quality.py

Run them with:

pytest

Current Constraints

The application is optimized for local single-user operation
Browser automation depends on third-party site structure and internal endpoints
Some environment variables documented in .env are optional rather than mandatory
run.py pins development startup to port 8001
The UI auth cookie is application-level only; it is not a substitute for Uber or Swiggy authentication

Summary

This codebase is not a static demo. It is a stateful local system with:

authenticated platform sessions
persistent behavioral memory
background trigger evaluation
live data hydration
explicit fallback and cache reuse
feedback-driven suppression and reweighting

That combination is the core of the product: not just prediction, but prediction that remains operational under unreliable external integrations.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
proactive_assistant_app		proactive_assistant_app
proactive_assistant_docs		proactive_assistant_docs
tests		tests
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
clear_data.sh		clear_data.sh
requirements.txt		requirements.txt
research_uber_api.py		research_uber_api.py
run.py		run.py

Folders and files

Latest commit

History

Repository files navigation

Proactive Assistant

Local Setup

Prerequisites

Install

Environment variables

Run

First-run sequence

Walkthrough

What the System Actually Does

Account connection

Background prediction loop

Prediction mechanics

Architecture

Backend

Persistence

Core modules

Frontend

Data Model

Raw history tables

Learned pattern tables

Triggering and feedback tables

Runtime resilience tables

Trigger Logic

Poll frequency

Ride triggers

Food triggers

Pattern Extraction

Ride pattern extraction

Food pattern extraction

Feedback reweighting

Live Data Hydration and Fallback Strategy

General fetch strategy

Uber live data

Swiggy live data

UI-visible fallback behavior

Playwright Integration

Uber

Swiggy

Authentication

API Overview

UI and auth

Live context

Uber

Food

Patterns and predictions

Health and realtime

Operating Notes

Session artifacts

Local database

Debug output

Health inspection

Testing

Current Constraints

Summary

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages