Python trace scoring candidate by CLowbrow · Pull Request #1278 · braintrustdata/braintrust-sdk-javascript

CLowbrow · 2026-01-16T05:24:07Z

No description provided.

Make it easier to initialize an experiment's dataset reference without calling `initDataset`

ibolmo

i think it's just otel that needs a follow-up, but check my other comments ...

ibolmo · 2026-01-21T00:53:29Z

py/examples/evals/eval_example.py

+    if expected is not None:
+        score = 1.0 if output == expected else 0.0
+
+    if trace:


as a user why/when would trace ever be none?

ibolmo · 2026-01-21T00:54:26Z

py/src/braintrust/functions/invoke.py

    :return: A function that can be used as a task or scorer.
    """
+    # Disable span cache since remote function spans won't be in the local cache
+    _internal_get_global_state().span_cache.disable()


may be too indirect. if the remote function spans are not in the cache what's the big deal in checking the cache?

ibolmo · 2026-01-21T00:55:45Z

py/src/braintrust/functions/test_invoke.py

+        assert state.span_cache.disabled is False
+
+        # Call init_function
+        f = init_function("test-project", "test-function")


it is a bit odd that loading a function will cause a cache to be disabled.

py/src/braintrust/span_cache.py

ibolmo · 2026-01-21T01:00:17Z

py/src/braintrust/framework.py

                root_span.log(output=output, metadata=metadata, tags=tags)

+                # Create trace object for scorers
+                from braintrust.trace import LocalTrace


i don't think we have a circ. dep so i'd move this to the top of the file

ibolmo · 2026-01-21T01:07:52Z

py/src/braintrust/logger.py

+        """
+        self._otel_flush_callback = callback
+
+    async def flush_otel(self) -> None:


where is this called? you probably need to find the right spot to call this and/or register_otel_flush

aside.. we solved this well with flush async. seems like we're in a similar pattern here.

one option is to insert quickly in mermory and on need to flush you then block to flush.

https://stackoverflow.com/a/29593596

ibolmo · 2026-01-21T01:22:41Z

py/src/braintrust/span_cache.py

+                        if not line:
+                            continue
+                        try:
+                            record_dict = json.loads(line)


maybe consider some starts_with and maybe prepending the span_id/root_id... etc to avoid too many json.loads

ibolmo · 2026-01-21T01:26:09Z

py/src/braintrust/trace.py

+            "root_span_id": self._root_span_id,
+        }
+
+    async def get_spans(self, span_type: Optional[list[str]] = None) -> list[SpanData]:


give the option to disable the cache

ibolmo · 2026-01-21T01:28:38Z

py/src/braintrust/logger.py

+                span_parents=self.span_parents,
+                span_attributes=serializable_partial_record.get("span_attributes"),
+            )
+            self.state.span_cache.queue_write(self.root_span_id, self.span_id, cached_span)


there could a performant problem if we write a lot to filesystem. would be nice to queue an amount and do bulk inserts to files to avoid read/write to filesystem

ibolmo · 2026-01-21T01:34:51Z

py/src/braintrust/trace.py

+SpanFetchFn = Callable[[Optional[list[str]]], Awaitable[list[SpanData]]]
+
+
+class CachedSpanFetcher:


future should we consider fetchign and saving the data locally

Alex Z added 30 commits November 9, 2025 11:24

kind of works

9d8f149

trace scoring

95c21c8

better api

4036939

more changes

d02704a

better api

942b49c

Merge branch 'main' into alex/trace-in-scorer

64e1070

make trace context flush before fetching

e734a6e

Merge branch 'main' into alex/trace-in-scorer

355cf55

rename

fda9d29

jsdoc

fcd8f91

Merge branch 'main' into alex/trace-in-scorer

f0f93af

cache v1

b13ee06

tmp file

c0035f6

flag

77b26a9

bump vers to fix test

59ee9b8

major bump

4c1bc79

disable local cache

2eeeb46

otel support?

30e5217

turn off cache when otel is used

f67f9e8

remove console.log

7807091

sensible new version

bc236ef

Merge branch 'main' into alex/trace-in-scorer

9e864e2

fix build

efea2cb

try to fix web builds

b5d117b

don't pass trace to scoring args

a34741d

Merge branch 'main' into alex/trace-in-scorer

59f6c7b

pass state into the trace object

b1dc350

get passed in state

f67aba5

make cache writes not block

dcc7dd7

remove trace re-export

7cf3a5a

Alex Z and others added 18 commits January 15, 2026 12:33

Merge branch 'main' into alex/trace-in-scorer

f2ae38f

Merge branch 'main' into alex/trace-in-scorer

c505f2a

Merge branch 'main' into alex/trace-in-scorer-python

26e0658

Merge branch 'alex/trace-in-scorer' into both-trace-scorers

61ba617

Merge branch 'alex/trace-in-scorer-python' into both-trace-scorers

28cfbe8

wip otel stuff

1314573

add a toJSON method

67abadc

Merge branch 'main' into alex/trace-in-scorer

a6d1dd4

Merge branch 'main' into alex/trace-in-scorer-python

499e6fc

Merge branch 'alex/trace-in-scorer' into both-trace-scorers

c630564

Merge branch 'alex/trace-in-scorer-python' into both-trace-scorers

d74224f

init-dataset-with-id (#1276)

5c18fe4

Make it easier to initialize an experiment's dataset reference without calling `initDataset`

Merge branch 'main' into alex/trace-in-scorer-python

e71bf48

Merge branch 'main' into alex/trace-in-scorer

b99f266

Merge branch 'alex/trace-in-scorer' into both-trace-scorers

4245dae

Merge branch 'alex/trace-in-scorer-python' into both-trace-scorers

c62769e

Merge branch 'main' into alex/trace-in-scorer-python

4a935ca

Merge branch 'main' into alex/trace-in-scorer-python

5d5ab97

CLowbrow requested a review from ibolmo January 21, 2026 00:26

CLowbrow changed the title ~~[WIP] Python trace scoring candidate~~ Python trace scoring candidate Jan 21, 2026

CLowbrow requested a review from colsondonohue January 21, 2026 00:26

CLowbrow marked this pull request as ready for review January 21, 2026 00:26

fix python

a7b13f8

ibolmo approved these changes Jan 21, 2026

View reviewed changes

Alex Z added 4 commits January 21, 2026 10:01

python updates

8bb623d

Merge branch 'main' into alex/trace-in-scorer-python

7d3cad9

handle playground logs in py

c97bf75

lint

a815eb5

CLowbrow merged commit 51bb20b into main Jan 21, 2026
65 checks passed

CLowbrow deleted the alex/trace-in-scorer-python branch January 21, 2026 19:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Python trace scoring candidate#1278

Python trace scoring candidate#1278
CLowbrow merged 98 commits intomainfrom
alex/trace-in-scorer-python

CLowbrow commented Jan 16, 2026

Uh oh!

ibolmo left a comment

Uh oh!

ibolmo Jan 21, 2026

Uh oh!

ibolmo Jan 21, 2026

Uh oh!

ibolmo Jan 21, 2026

Uh oh!

Uh oh!

ibolmo Jan 21, 2026

Uh oh!

ibolmo Jan 21, 2026

Uh oh!

ibolmo Jan 21, 2026

Uh oh!

ibolmo Jan 21, 2026

Uh oh!

ibolmo Jan 21, 2026

Uh oh!

ibolmo Jan 21, 2026

Uh oh!

ibolmo Jan 21, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		SpanFetchFn = Callable[[Optional[list[str]]], Awaitable[list[SpanData]]]


		class CachedSpanFetcher:

Conversation

CLowbrow commented Jan 16, 2026

Uh oh!

ibolmo left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants