Develop `Context` class #34

ubdbra001 · 2025-09-08T15:26:31Z

closes #32 (eventually)

I'm happy to leave this open and just add to it as I approach the rest of the methods, but if you'd like to open a PR for each of the rest of the methods then I'm also happy to defer to you on that.

Here's my initial attempt at this:

I've developed the low-hanging fruit for Context (path, entities, datatype, suffix, extension, modality, and size) and added stubs for the rest
I've also added tests for the methods developed

One thing I wasn't sure about is where it would be beneficial to use cached_property over property for this class (and I suppose more generally). Any advice?

My next step will be to develop the subjects method along with the Sessions class. I think I'll start by creating test for these (so you can make sure I understand how these are meant to work) and then go from there.

codecov · 2025-09-08T15:27:36Z

Codecov Report

❌ Patch coverage is 92.01878% with 17 lines in your changes missing coverage. Please review.
✅ Project coverage is 90.95%. Comparing base (51450b3) to head (88ce2e5).
⚠️ Report is 38 commits behind head on main.

Files with missing lines	Patch %	Lines
src/bids_validator/context.py	85.43%	12 Missing and 3 partials ⚠️
src/bids_validator/types/files.py	89.47%	1 Missing and 1 partial ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main      #34      +/-   ##
==========================================
+ Coverage   89.68%   90.95%   +1.27%     
==========================================
  Files          12       12              
  Lines         630      774     +144     
  Branches      104       87      -17     
==========================================
+ Hits          565      704     +139     
- Misses         42       46       +4     
- Partials       23       24       +1

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

effigies

Cool. A couple quick notes.

src/bids_validator/context.py

tests/test_context.py

src/bids_validator/context.py

tests/test_context.py

for Sessions class and associated tests for Context class

ubdbra001 · 2025-09-09T13:10:22Z

I just added the tests for the subject method and Sessions class. Do they make sense? Am I missing anything?

effigies · 2025-09-09T13:38:19Z

Makes sense to me. I would just skip sessions.phenotype. That's been removed in the latest version of BIDS, since it was never used.

ubdbra001 · 2025-09-09T15:11:09Z

Started playing around with this and I'm using the walk_back function to work backwards from the specific file in Context to get the tree for the subject entity.

Does this make sense?
If yes, the walk_back / _walk_back function throws an error because FileParts now expects schema to be passed (Line 249). Shall I just update this so that schema is passed to walk_back? Or do you have a better suggestion?

effigies · 2025-09-09T18:09:53Z

I would actually probably create subject once when entering a subject directory, and then pass Context(dataset, subject, file) on each file inside it.

def walk(directory, dataset, subject=None):
    if subject is None and is_subject_dir(directory):
        subject = Subject(directory, dataset)

    for child in directory.children:
        yield Context(child, dataset, subject)
        if child.is_dir():
            yield from walk(child, dataset, subject)

tree = FileTree.read_from_filesystem(root)
dataset = Dataset(root, schema)
for context in walk(tree, dataset):
    ...

ubdbra001 · 2025-09-11T12:55:16Z

Right, that does make sense. This should be in __main__.py where we're starting walking through the FileTree from the root directory?
Is this the only part of the context that would benefit from earlier creation it earlier in the walk and then passing it to the Context object?
Looking at the list in #32 I don't think so, but I thought it would be worth double checking.

ubdbra001 · 2025-09-11T12:59:08Z

Also, am I right in thinking that we'll still be using the walk_back functions for the sidecar? And if so we'll still need to fix up those functions to pass schema on?

effigies · 2025-09-11T13:48:57Z

This should be in __main__.py where we're starting walking through the FileTree from the root directory?

Yes, or wherever we do the walk.

Is this the only part of the context that would benefit from earlier creation it earlier in the walk and then passing it to the Context object?

Yes, Dataset and Subject are the only two.

Also, am I right in thinking that we'll still be using the walk_back functions for the sidecar?

Yes.

And if so we'll still need to fix up those functions to pass schema on?

Seems excessive. What do you think about making schema optional and just leaving datatype as None if schema is not passed?

Include schema as input param and instansiate dataset object

now takes dataset as input param, generates subject object if required, and yields context object

effigies

A couple notes.

src/bids_validator/__main__.py

src/bids_validator/context.py

walk now returns context object which needs to be handled differently

ubdbra001 · 2025-09-24T17:48:00Z

I ended up using a load_sidecar function because it made for a simpler test to check that inheritance is working properly.
I've started testing more files to capture the different behaviours (e.g. file that doesn't have a sidecar, etc), it seems a bit messy to me so any guidance on how best to tidy it would be welcome.

effigies · 2025-09-25T16:39:49Z

We've tied ourselves pretty tightly to the filesystem here, which makes mocking up objects difficult, as well as anything where we won't have access to directory entries, like datasets in the cloud.

I think it probably makes sense to wrap a Path | UPath instead DirEntry, and limit ourselves to the Path API.

I think we could then create mock files for testing by subclassing UPath to provide things like read_bytes() and is_dir().

class TestPath(UPath):
    def __init__(path, contents):
        self._contents = contents

    def read_bytes(self):
        return self._contents

Or possibly we end up with a TestTree that generates Path-like objects on demand.

tree = TestTree({
  '/bold.json': json.dumps({'RepetitionTime': 2}).encode(),
  ...
})

src/bids_validator/__main__.py

src/bids_validator/context.py

effigies · 2025-09-29T15:55:16Z

Resetting to draft, pending columns implementation.

effigies · 2025-09-30T01:29:58Z

Made a PR against your PR: ubdbra001#1

Can factor it out, if you prefer. It's mostly orthogonal.

rf: Replace direntry with UPath in FileTree

ubdbra001 · 2025-10-13T14:48:30Z

I've added the code for the columns property. I also added some tests, but: 1. these are still using the file system, and 2. (related) the contents of the tar.gz files is quite large, and I didn't get a chance to think about how to deal with this.
Addressing 1 and creating some smaller test files in memory would address 2, but I think that may be worth doing separately as an overhaul for the current context.py test file

src/bids_validator/context.py

Co-authored-by: Chris Markiewicz <[email protected]>

ubdbra001 added 2 commits September 8, 2025 16:17

added intial tests for Context class

1a9dfdb

Context class WIP

2db6ba9

ubdbra001 changed the title ~~I32 develop context class~~ Develop Context class Sep 8, 2025

effigies reviewed Sep 8, 2025

View reviewed changes

ubdbra001 added 2 commits September 8, 2025 23:39

PR feedback changes

5cc92f5

add missing docstrings

0d427f4

ubdbra001 force-pushed the i32-develop-context-class branch from 7c10dda to 0d427f4 Compare September 8, 2025 22:55

add and use global mapping for datatype to modality

72dbbeb

ubdbra001 force-pushed the i32-develop-context-class branch from 9185ba7 to 72dbbeb Compare September 8, 2025 23:12

add more tests for next stage

eeb692f

for Sessions class and associated tests for Context class

ubdbra001 added 9 commits September 24, 2025 12:43

Merge branch 'main' into i32-develop-context-class

3d70d8f

remove sessions.phenotype test

7200e99

add sessions class

729bf51

add subject as an input param for Context class

96dd2dc

Update and validate function

1513ff8

Include schema as input param and instansiate dataset object

Update walk function

685b978

now takes dataset as input param, generates subject object if required, and yields context object

update tests

d2d9975

tidy for ruff

441dd49

add schema_path cli option

fa203d1

effigies reviewed Sep 24, 2025

View reviewed changes

src/bids_validator/__main__.py Outdated Show resolved Hide resolved

src/bids_validator/__main__.py Show resolved Hide resolved

src/bids_validator/context.py Outdated Show resolved Hide resolved

src/bids_validator/context.py Outdated Show resolved Hide resolved

ubdbra001 added 2 commits September 24, 2025 15:12

Update validate function

3aa3fec

walk now returns context object which needs to be handled differently

feedback changes

fbbe129

ubdbra001 added 6 commits September 24, 2025 17:52

add sidecar method to context class

ecc0f37

use orjson over built-in json package

0343be1

refine tests for sidecar

162a7c9

add test for json property

fe42bb7

ruffed test_context

adb4b75

add json property to context class

85f7733

ubdbra001 force-pushed the i32-develop-context-class branch from c1c8bc2 to 85f7733 Compare September 24, 2025 17:43

effigies approved these changes Sep 29, 2025

View reviewed changes

src/bids_validator/__main__.py Outdated Show resolved Hide resolved

src/bids_validator/__main__.py Outdated Show resolved Hide resolved

src/bids_validator/context.py Outdated Show resolved Hide resolved

effigies marked this pull request as ready for review September 29, 2025 15:22

effigies marked this pull request as draft September 29, 2025 15:54

ubdbra001 and others added 5 commits September 29, 2025 16:56

feedback changes

6a90acc

update test for context.sidecar

37df90a

update typing for main

1360680

rf: Replace direntry with UPath in FileTree

e68bc1d

test: Generate dataset in-memory, test sidecar overrides

0d31404

ubdbra001 added 4 commits October 8, 2025 10:19

Merge pull request #1 from effigies/rf/upath

2fc29ed

rf: Replace direntry with UPath in FileTree

add load_tsv_gz function

85c250a

add columns method for context class

67e6751

added tests for tsv and tsv.gz loaders

4974176

effigies reviewed Oct 13, 2025

View reviewed changes

src/bids_validator/context.py Outdated Show resolved Hide resolved

ubdbra001 and others added 2 commits October 13, 2025 16:04

Commit PR feedback

7aeb04e

Co-authored-by: Chris Markiewicz <[email protected]>

add decode for gzip binary object

88ce2e5

effigies marked this pull request as ready for review October 13, 2025 17:54

effigies merged commit 95a021a into bids-standard:main Oct 13, 2025
24 of 25 checks passed

ubdbra001 deleted the i32-develop-context-class branch October 16, 2025 08:56

Develop Context class #34

Develop Context class #34

Uh oh!

Conversation

ubdbra001 commented Sep 8, 2025

Uh oh!

codecov bot commented Sep 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

effigies left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ubdbra001 commented Sep 9, 2025

Uh oh!

effigies commented Sep 9, 2025

Uh oh!

ubdbra001 commented Sep 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

effigies commented Sep 9, 2025

Uh oh!

ubdbra001 commented Sep 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ubdbra001 commented Sep 11, 2025

Uh oh!

effigies commented Sep 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

effigies left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ubdbra001 commented Sep 24, 2025

Uh oh!

effigies commented Sep 25, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

effigies commented Sep 29, 2025

Uh oh!

effigies commented Sep 30, 2025

Uh oh!

ubdbra001 commented Oct 13, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Develop `Context` class #34

Develop `Context` class #34

codecov bot commented Sep 8, 2025 •

edited

Loading

ubdbra001 commented Sep 9, 2025 •

edited

Loading

ubdbra001 commented Sep 11, 2025 •

edited

Loading

effigies commented Sep 11, 2025 •

edited

Loading