Websight

Vision first browser agents based on Websight-7B, a custom 7B parameter model.

Installation

pip install websight
# or
uv add websight

Quickstart

Call the model directly on an image:

from websight import websight_call

action = websight_call(
    prompt="Click the Login button",
    history=[],  # prior (reasoning, action) pairs if you have them
    image_base64="data:image/png;base64,<...>",
)
print(action.action)  # e.g., "click"
print(action.args)    # e.g., {"x": "175", "y": "514"}

Reference

websight.websight_call

def websight_call(
    prompt: str,
    history: list[tuple[str, str]],
    image_base64: str,
    console: rich.console.Console | None = None,
    max_new_tokens: int = 1000,
) -> Action

Calls the Websight VLM with a screenshot and instruction, returning a structured Action.

websight.Action

class Action(BaseModel):
    action: str                # e.g. "click", "drag", "type", "scroll", ...
    args: dict[str, str]       # e.g. {"x": "175", "y": "514"}
    reasoning: str             # model rationale

websight.Agent

from websight.agent import Agent

agent = Agent(show_browser=False)
result = agent.run("Open https://example.com and search for 'websight'", max_iterations=10)

Basic Agent loop using Playwright: takes a screenshot, calls websight_call, parses and executes the predicted action, and repeats until it sees finished(...).

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
eval		eval
eval_results		eval_results
scripts		scripts
src/websight		src/websight
tests		tests
websight-7B		websight-7B
.gitattributes		.gitattributes
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock
websight.py		websight.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Repository files navigation

Websight

Installation

Quickstart

Reference

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

Uh oh!

Uh oh!

SuperAce100/websight

Folders and files

Latest commit

History

Repository files navigation

Websight

Installation

Quickstart

Reference

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages