Toast 3 Work in Progress #369

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Draft

tskisner wants to merge 766 commits into master from toast3

Member

tskisner commented Oct 15, 2020

Hi @zonca and @keskitalo, this PR is for your feedback on API changes that we discussed offline. In addition to looking at the source, I have been updating the tutorials/01_Introduction/intro.ipynb notebook as a "look and feel" example. I have attached a rendered version of the notebook:

Main features are:

Observation class as the new data model, with detdata, shared, intervals, and view attributes as the key places where the contents are influenced.
Improved processing model with changes to the Operator class and the new Pipeline operator. These classes are configured with traitlets.
New distributed map classes (PixelDistribution and PixelData) which split the calculation of the distribution from the actual data storage. These have the new Alltoallv communication pattern.

There are only 2-3 operators that have been ported to the new API as a demo. I'll continue on some config file work that needs to be updated since the switch to traitlets.

tskisner requested review from keskitalo and zonca

October 15, 2020 20:50

tskisner marked this pull request as draft

October 15, 2020 20:50

Member

zonca commented Oct 15, 2020

I think the easiest way to provide feedback would be to make an export of the notebook to a Python script in a separate pull request, so we can do line by line feedback there. Then the pull request can be closed without merging.

Member Author

tskisner commented Oct 15, 2020

Good idea @zonca, will do that soon.

Member Author

tskisner commented Oct 16, 2020

Ok @zonca, I enabled the reviewnb plugin on the toast repo. I think you can browse here:

https://app.reviewnb.com//pull/369/files/

and comment on the per-cell level of the intro.ipynb file. Since a lot has changed, there is a switch to "hide previous version". Let me know if that works, since I can't tell if this plugin is usable by everyone.

Member Author

tskisner commented Oct 16, 2020

Note that the output of the notebook is stripped on github, so refer to the PDF attached to this issue to look at that.

Member Author

tskisner commented Oct 16, 2020

Updated notebook output, with config section.
intro.pdf

zonca reviewed

View reviewed changes

tutorial/01_Introduction/intro.ipynb Show resolved Hide resolved

tutorial/01_Introduction/intro.ipynb

    
                 "metadata": {

                  "toc-hr-collapsed": false

                 },

                 "source": [

Member

zonca Oct 16, 2020 •

edited

Loading

mention if you can modify this in place or it is read-only. If it is read-only, how do I modify it?

Reply via ReviewNB

Member Author

tskisner Oct 16, 2020

Good point, some options are fixed by the OS runtime environment of python, but some can be changed after toast is imported. Will give more details.

Member Author

tskisner Oct 29, 2020

Added text about setting log level manually or through environment. Same with threading.

tutorial/01_Introduction/intro.ipynb

    
                 "metadata": {

                  "toc-hr-collapsed": false

                 },

                 "source": [

Member

zonca Oct 16, 2020 •

edited

Loading

better specify that "traditional CPU systems" means a supercomputer, otherwise it seems it is also required on a laptop.

Reply via ReviewNB

Member Author

tskisner Oct 16, 2020

Well, if the user has an AMD Ryzen workstation with 16 cores (for example), then they probably want to use mpi4py if they are doing something more with toast than just running a notebook with a few samples. I will definitely clarify though. I have started an "intro_parallel.ipynb" where I am going to discuss using IPython.parallel with mpi4py. I'll reference that in the serial notebook.

Member Author

tskisner Oct 29, 2020

Tried to clarify that toast parallelism is mainly through MPI, so that any system with more than a few cores will benefit from having mpi4py installed.

tutorial/01_Introduction/intro.ipynb

    
                 "metadata": {

                  "toc-hr-collapsed": false

                 },

                 "source": [

Member

zonca Oct 16, 2020 •

edited

Loading

do you plan to implement units here?

Reply via ReviewNB

Member Author

tskisner Oct 16, 2020

Good idea. astropy.units are a new addition to toast, and currently only used in the new Operator traits. I need to systematically go through the codebase and add support.

Member Author

tskisner Oct 30, 2020

I converted the fake focalplane simulation and plotting functions to use units. However, I'll wait on the rest of the instrument classes until we can revisit the overall plan for those.

tutorial/01_Introduction/intro.ipynb

    
                 "metadata": {

                  "toc-hr-collapsed": false

                 },

                 "source": [

Member

zonca Oct 16, 2020 •

edited

Loading

if detlabels is None, you could use the keys as labels, so we avoid to build the trivial dict x:x.

please use keyword arguments for all inputs, so people don't have to look at the help of plot_focalplane

For the color, what about using endswith("A") instead of enumerating?

Reply via ReviewNB

Member Author

tskisner Oct 30, 2020

Ok, this is cleaned up.

tutorial/01_Introduction/intro.ipynb

    
                 "metadata": {

                  "toc-hr-collapsed": false

                 },

                 "source": [

Member

zonca Oct 16, 2020 •

edited

Loading

I prefer namespacing, what about import toast.config as tc?

Reply via ReviewNB

tutorial/01_Introduction/intro.ipynb

    
                 "metadata": {

                  "toc-hr-collapsed": false

                 },

                 "source": [

Member

zonca Oct 16, 2020 •

edited

Loading

either it is an attribute, so other_simsat.config

or it is a method then needs to have a verb in the name:

other_simsat.get_config()

Reply via ReviewNB

Member Author

tskisner Oct 29, 2020

The methods are now get_config() and get_class_config()

tutorial/01_Introduction/intro.ipynb

    
                 "metadata": {

                  "toc-hr-collapsed": false

                 },

                 "source": [

Member

zonca Oct 16, 2020 •

edited

Loading

see comment above

Reply via ReviewNB

Member Author

tskisner Oct 16, 2020

I was inspired by the traitlets methods traits() and class_traits(), but I can add a "get_" in there if it is more clear.

Member

zonca Oct 16, 2020

yes please

tutorial/01_Introduction/intro.ipynb

    
                 "metadata": {

                  "toc-hr-collapsed": false

                 },

                 "source": [

Member

zonca Oct 16, 2020 •

edited

Loading

units are beautiful in the file!

Reply via ReviewNB

tutorial/01_Introduction/intro.ipynb Show resolved Hide resolved

Member

zonca commented Oct 16, 2020

I had heard about it but first time I used reviewnb , it is awesome!

@tskisner, I think the Toast interface looks really good, good job! I have a bit more feedback in the notebook.

Member Author

tskisner commented Oct 16, 2020

Thanks @zonca for the detailed review, just what I was looking for! On overall question for objects that act like a dictionary, but which have other state information. For example, obs.detdata stores a DetectorData objects, but also information about the observation instance like number of samples. So the call to

obs.detdata.create("foo", shape=(5,), dtype=np.int64)

actually creates a DetectorData object under the hood using the number of observation samples. I could also do:

obs.detdata["foo"] = DetectorData(obs.detectors, (obs.n_sample, 5), dtype=np.int64)

And then have setitem check that the number of samples matches that in the observation. This did not seem as convenient to me, but I do hate typing :-)

For the MPI shared memory class, I could replace the set method like this:

# Slice of buffer set by offset and shape of data
obs.shared["foo"].set(data, offset=(3, 6), fromrank=2)

or

# Slice of buffer set by explicity range
obs.shared["foo"][3:7, 6:8] = (data, 2)

or something else?

Member

zonca commented Oct 16, 2020 •

edited

Loading

inside __setitem__ you can do:

    def __setitem__(self, key, value):
        # if key is undefined
        data = DetectorData(self.detectors, (self.n_sample,)+ value.shape, dtype=value.dtype)
        # set this into the dict

for the set I don't understand, what do you need the fromrank for?

Member Author

tskisner commented Oct 16, 2020

I'll try to clarify... The detdata attribute holds DetectorData objects. So I can certainly modify setitem to do sanity checks on the input value and make sure that the shape is compatible and if so, just assign to the internal dictionary. However, that requires the user to pre-create the DetectorData object first, using information from the observation instance. It seemed like more work for the user this way...

For the MPIshared class, the set() operation is collective and takes data from one process and broadcasts to shared memory across multiple nodes. So there is an argument (fromrank) to determine which process has the data. I guess I could figure that out though by doing an allreduce first to see which process has a non-None value.

Member Author

tskisner commented Oct 16, 2020

I think I understand now- you want to allow obs.detdata setitem to take a numpy array and assume that it is the full 2D contents of the underlying DetectorData. So then you could do:

obs.detdata["foo"] = np.zeros((len(obs.detectors), obs.n_sample), dtype=np.int64)

I can implement that, but still not sure it is more convenient. On the other hand, no reason not to support multiple ways of assignment!

Member Author

tskisner commented Oct 16, 2020

Ahhh, now I see- you can create the full-size DetectorData object first, and then the slicing notation can be applied when actually assigning from the RHS.

Ok, I will try this out. I agree this would be a more convenient interface. I'll also try to modify the MPIshared package to make the set() method optional at the cost of a precalculation.

Member Author

tskisner commented Oct 17, 2020

I have added setitem support to the upstream pshmem package:

https://github.com/tskisner/pshmem/releases/tag/0.2.5

And this new version is available on PyPI:

https://pypi.org/project/pshmem/0.2.5/

So now I can work on using that syntax in toast.

Member Author

tskisner commented Oct 17, 2020

Ok, I think I have concluded that the create() methods cannot be replaced in the proposed way. The Observation attributes detdata and shared do not have access to the right-hand-side of the assignment, since that is passed to the DetectorData.__setitem__() method after the manager class __getitem__() is called. To simplify the example, look at this snippet of code:

import numpy as np

class DetData:

    def __init__(self, ndet, shape, dtype):
        self.dtype = np.dtype(dtype)
        self.shape = [ndet]
        self.flatshape = ndet
        for s in shape:
            self.shape.append(s)
            self.flatshape *= s
        self.flatdata = np.zeros(self.flatshape, dtype=self.dtype)
        self.data = self.flatdata.reshape(self.shape)

    def __getitem__(self, key):
        print("DetData getitem {}".format(key))
        return self.data[key]

    def __setitem__(self, key, value):
        print("DetData setitem {}".format(key))
        self.data[key] = value

    def __repr__(self):
        return str(self.data)

class Mgr:
    
    def __init__(self, ndet):
        self.ndet = ndet
        self.d = dict()

    def create(self, name, shape, dtype):
        self.d[name] = DetData(self.ndet, shape, dtype)
        
    def __getitem__(self, key):
        print("Calling Mgr getitem")
        if key not in self.d:
            # Cannot guess what shape and dtype the user wants
            pass
        return self.d[key]

    def __setitem__(self, key, value):
        print("Calling Mgr setitem")
        self.d[key] = value


mgr = Mgr(2)

# This works fine, as expected:

mgr.create("A", (3, 4), np.int32)

mgr["A"][1, 0:2, 0:2] = 5

print("mgr['A'] = \n", mgr["A"])

# This works, but it is annoying, since the user has to know the name
# of the DetData class and also has to get information from the Mgr
# class:

mgr["B"] = DetData(mgr.ndet, (3, 4), np.int32)

mgr["B"][1, 0:2, 0:2] = 5

print("mgr['B'] = \n", mgr["B"])

# The code below is actually doing:
#
# Mgr.__getitem__("C").__setitem(tuple, 5)
#
# Which means that the DetData class would have to be instantiated in
# Mgr.__getitem__() where we don't know the correct shape of the buffer
# to create.  Obviously this gives a key error:

mgr["C"][1, 0:2, 0:2] = 5

print("mgr['C'] = \n", mgr["C"])

The output of the above script is:

Calling Mgr getitem
DetData setitem (1, slice(0, 2, None), slice(0, 2, None))
Calling Mgr getitem
mgr['A'] = 
 [[[0 0 0 0]
  [0 0 0 0]
  [0 0 0 0]]

 [[5 5 0 0]
  [5 5 0 0]
  [0 0 0 0]]]
Calling Mgr setitem
Calling Mgr getitem
DetData setitem (1, slice(0, 2, None), slice(0, 2, None))
Calling Mgr getitem
mgr['B'] = 
 [[[0 0 0 0]
  [0 0 0 0]
  [0 0 0 0]]

 [[5 5 0 0]
  [5 5 0 0]
  [0 0 0 0]]]
Calling Mgr getitem
Traceback (most recent call last):
  File "setitem.py", line 75, in <module>
    mgr["C"][1, 0:2, 0:2] = 5
  File "setitem.py", line 40, in __getitem__
    return self.d[key]
KeyError: 'C'

Member

zonca commented Oct 17, 2020

you first need to create the thing before slicing it:

mgr = Mgr(2)

mgr["A"] = np.zeros((3,4), dtype=np.int32)

mgr["A"][1, 0:2, 0:2] = 5

It doesn't work now, but it should be implementable.

Member Author

tskisner commented Oct 19, 2020

Hi @zonca, thanks for your patience, and sorry if I am being dense :-) Below I updated the toy code to be closer to the real case. The central problem is that when assigning data (see the mgr["C"] case), we don't know which detector and sample range is being specified. If the user assigns data for the full array (see the mgr["D"] case), then we can assign it, but otherwise all we can do is create the buffer and then leave it.

import numpy as np

class DetData:
    def __init__(self, ndet, shape, dtype):
        self.dtype = np.dtype(dtype)
        self.shape = [ndet]
        self.flatshape = ndet
        for s in shape:
            self.shape.append(s)
            self.flatshape *= s
        self.flatdata = np.zeros(self.flatshape, dtype=self.dtype)
        self.data = self.flatdata.reshape(self.shape)

    def __getitem__(self, key):
        return self.data[key]

    def __setitem__(self, key, value):
        self.data[key] = value

    def __repr__(self):
        return str(self.data)


class Mgr:
    def __init__(self, ndet, nsample):
        self.ndet = ndet
        self.nsample = nsample
        self.d = dict()

    def create(self, name, sample_shape, dtype):
        self.d[name] = DetData(self.ndet, (self.nsample,) + sample_shape, dtype)

    def __getitem__(self, key):
        return self.d[key]

    def __setitem__(self, key, value):
        if isinstance(value, DetData):
            self.d[key] = value
        else:
            # This is an array, verify that the number of dimensions match
            sample_shape = None
            if len(value.shape) < 2:
                raise RuntimeError("Assigned array does not have sufficient dimensions")
            elif len(value.shape) == 2:
                # We assume the user meant one scalar value per sample...
                sample_shape = (1,)
            else:
                # The first two dimensions are detector and sample.  The rest of the
                # dimensions are the data shape for every sample and must be fully
                # specified when creating data like this.
                sample_shape = value.shape[2:]
            print(
                "Creating DetData with {} dets, {} samples, {} samp shape".format(
                    self.ndet, self.nsample, sample_shape
                )
            )
            self.d[key] = DetData(
                self.ndet, (self.nsample,) + sample_shape, value.dtype
            )
            # If the value has the full size of the DetData object, then we can do the
            # assignment, if not, then we cannot guess what detector / sample slice
            # the user is trying to assign.
            if (value.shape[0] == self.ndet) and (value.shape[1] == self.nsample):
                # We can do it!
                self.d[key][:] = value


# 2 detectors and 5 samples
mgr = Mgr(2, 5)

# This works fine, as expected:

mgr.create("A", (3, 4), np.int32)
mgr["A"][1, 2:3, 0:2, 0:2] = 5
print("mgr['A'] = \n", mgr["A"])

# This works, but it is annoying, since the user has to know the name
# of the DetData class and also has to get information from the Mgr
# class:

mgr["B"] = DetData(mgr.ndet, (mgr.nsample, 3, 4), np.int32)
mgr["B"][1, 2:3, 0:2, 0:2] = 5
print("mgr['B'] = \n", mgr["B"])

# This creates a buffer with the full number of detectors and samples and uses the
# last dimensions of the RHS to determine the shape of the data per sample.  However,
# we have no information about what LHS slice we are assigning the RHS data to.  UNLESS
# the user gives a RHS data with the full n_detector x n_sample data set:

# mgr["C"] is created by not assigned, since we don't know where to assign the data
# along the first 2 axes (detector and sample).
mgr["C"] = np.ones((1, 1, 3, 4), dtype=np.int32)
mgr["C"][1, 2:3, 0:2, 0:2] = 5
print("mgr['C'] = \n", mgr["C"])

# mgr["D"] is created AND assigned, since we specify data of the full size.
mgr["D"] = np.ones((mgr.ndet, mgr.nsample, 3, 4), dtype=np.int32)
mgr["D"][1, 2:3, 0:2, 0:2] = 5
print("mgr['D'] = \n", mgr["D"])

I think that the mgr["C"] case is actually very confusing, since we are using the right hand side value just to get dimensions but not actually placing those values into the new array (since we don't have information about the location of that data in the full array).

How about we support cases A, B, and D:

Explicit create method
Assignment of a pre-created DetData, with checks for compatible number of detectors and samples
Assignment of a numpy array with the full-size data.

Does that seem acceptable?

Member

zonca commented Oct 19, 2020 •

edited

Loading

here:

# mgr["C"] is created by not assigned, since we don't know where to assign the data
# along the first 2 axes (detector and sample).
mgr["C"] = np.ones((1, 1, 3, 4), dtype=np.int32)
mgr["C"][1, 2:3, 0:2, 0:2] = 5
print("mgr['C'] = \n", mgr["C"])

This case is not supported, the user needs to initialize the array in 2 ways:

provide all samples/ all detectors (see case D)
provide all samples/1 detector (will be replicated to all detectors)

so the use case is:

# provide just 1 timeline, it will be copied to all detectors, we should support both 3D and 4D
mgr["C"] = np.ones((mgr.n_samples, 3, 4), dtype=np.int32)
# or
mgr["C"] = np.ones((1, mgr.n_samples, 3, 4), dtype=np.int32)
mgr["C"][1, 2:3, 0:2, 0:2] = 5
print("mgr['C'] = \n", mgr["C"])

inside mgr there should be an assert that checks that we have the right axis having a length of n_samples.

Member Author

tskisner commented Oct 19, 2020

Ok, no problem that sounds good. Will work on implementing and addressing your other feedback.

tskisner force-pushed the toast3 branch from 63a50b0 to 5874e56 Compare

November 17, 2020 17:13

review-notebook-app bot commented Apr 11, 2021

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

keskitalo force-pushed the toast3 branch from b9add99 to ab4e179 Compare

September 26, 2022 23:27

tskisner and others added 6 commits

August 16, 2023 18:34


          Bump tag

dde70d7


          fix broken scheduling mode (#694)

cde33e3


          Extract HDF5 observation metadata loading into a separate function (#699

485736a

)

* Extract HDF5 observation metadata loading into a separate function

* Run format_source


          Fix flagging of noise model fits (#698)

4de4bc3

* Fix flagging of noise model fits

* Some additional comments

* Add comment for error helper function


          Schedule field separator (#697)

68c6e4b

* Add support for arbitrary field separators

* Fix cooler cycle target

* Fix silly bugs

* Add unit test; fix schedule parsing

* Better unit test


          Unwrap the HWP angle before perturbing and guard against stepped HWP (#…

3980cbb

…700)

tskisner and others added 30 commits

October 14, 2025 18:47


          Allow user to disable log redirection (#882)

ea8329e

* Allow user to disable log redirection

* Change disabling of redirect to use a boolean flag

* Make no-redirection the default


          Highpass before demod (#884)

77aa5f3

* Highpass-filter the signal before demodulation

* Experiment with bandpass filtering

* Use passband rather than highpass filter

* Flag pathological detectors

* Better help strings

* Add operator for rms/skew/kurtosis-based detector cuts

* Remove debug statements

* Remove debug statement, make error message more informative

* When deglitching demodulated data, apply flags to all coupled streams

* Fix error in flag mask application

* Fix two corner cases in time constant deconvolution

* Add a helpful warning message

* Add accessor for the detector properties list

* Calibrate now looks for gains in the focalplane database when a dictionary is not found in the Observation. It is an error if no gains are found

* Demodulation allows adjusting the passband

* better trait names

* Remove stale code


          Small fixes to issues found when batch processing toast_run. (#885)

452ff27

- Fix bug in parsing internal HDF5 paths of instrument files

- Implement writing of ground schedules to enable splitting
  and writing out schedule files from python.

- Rework GroundSchedule I/O to implement ECSV format.

- Fix segfault when inverting pixel covariance when some
  processes have no local pixels.

- Add utility script to plot maps and compute pseudo cls from
  job outputs.

- In coadd script, add support for accumulating hits map and
  also support per-map weights in the input text file.


          Testing wheels for 3.0.0a40 tag (#888)

345e947

* Testing wheels for 3.0.0a40 tag

* Restore tests and test doc build

* Bump python version and mkdocs-jupyter for docs build

* Remove stale docs reference

* Restore testing


          Restore accidental trait swap (#890)

7f4413e

The recent refactor of the TimeConstant operator accidentally swapped
the meaning of the deconvolve trait.  This fixes that and also adds
a unit test to verify that the convolution introduces an expected
phase shift.


          Improvements to healpix plotting (#889)

f5c3ff1

* Improvements to healpix plotting

* Change default colormap to bwr


          Fix a typo that has been causing excessive memory use (#891)

3379e1d


          Implement ops.Detrend

f3f7eeb

Simple detrend operator, removing mean/median/slope, with unit test
script included.

Signed-off-by: Bai-Chiang <[email protected]>


          Add detrend.py to CMakeLists.txt

c256f44


          Clarify per-map noise weights in coaddition (#892)

4d106cb

* Clarify per-map noise weights in coaddition

* When running tests force spt3g to 1.0.0, which was the last to support macos x86_64

* Apply extra inverse noise weights after we are certain we are working with noise weighted maps


          Add a helper script which creates a fake telescope HDF5 file (#893)

6bd5feb

* Add a helper script which creates a fake telescope HDF5 file

The SimGround and SimSatellite operators were recently updated
to accept a "telescope" file which contains all the instrument
and site information to conduct simulations.  This work adds
a helper script to construct generic telescope files for purely
synthetic experiments.

* Fix unit tests for MPI case


          can filter different detector patterns: polyfilter, groundfilter

3d0bedb


          Scan map fix (#894)

638d383

* Initialize `bunit`

* Fix error in deriving scaling from units

* Add a half-a-pixel offset in RA direction

* Remove erroneus offset correction

* Add a special so3g_compat_mode to SimGround which forces the pointing weather to match so3g defaults


          T2p filter (#898)

44ee2fd

* Fix some typos in the HWP filter

* Functional and unit-tested implementation

* Update src/toast/ops/t2pfilter.py

copilot caught a typo!

Co-authored-by: Copilot <[email protected]>

* Update src/toast/ops/t2pfilter.py

copilot caught an error

Co-authored-by: Copilot <[email protected]>

* Update src/toast/ops/t2pfilter.py

Another point for copilot

Co-authored-by: Copilot <[email protected]>

* Update src/toast/ops/t2pfilter.py

Co-authored-by: Copilot <[email protected]>

* Update src/toast/tests/ops_t2pfilter.py

Co-authored-by: Copilot <[email protected]>

* Update src/toast/tests/ops_t2pfilter.py

Co-authored-by: Copilot <[email protected]>

* Update src/toast/tests/ops_t2pfilter.py

Co-authored-by: Copilot <[email protected]>

* Update src/toast/tests/ops_t2pfilter.py

Co-authored-by: Copilot <[email protected]>

* Update src/toast/tests/ops_t2pfilter.py

Co-authored-by: Copilot <[email protected]>

* Format source, remove unused trait

---------

Co-authored-by: Copilot <[email protected]>


          allow applying different filters to demodulated data components

c72fefb

can filter different detector patterns: polyfilter, groundfilter


          Clean up analysis printing and fix a bug (#896)

56de539


          Large refactor of HDF5 support (#897)

6658e3f

* Large refactor of HDF5 support

- Remove compile-time dependency on libFLAC and use flacarray instead

- Convert the existing version-1 HDF5 data format to use flacarray for
  low-level compression / decompression, while remaining backwards
  compatible with the existing on-disk group and dataset structure.

- Define a new version (2) of the native HDF5 data format which makes
  use of flacarray and enforces consistent use of per-field compression
  parameters.  This new version adds support for saving / loading
  arbitrary observation attributes, provided they define the save_hdf5()
  and load_hdf5() methods.

- Fix Observation duplicate() and __eq__() methods to correctly handle
  arbitrary attributes which are more complicated class instances.

* Fix import removed by cleanup

* - Add support for save / load of metadata consisting of nested
  python containers with scalars and arrays.

- Add Observation helper function for testing metadata equality

* - Overhaul instrument classes to provide consistent save_hdf5 and
  load_hdf5 methods.

- Update the observation save / load functions to use these instrument
  methods.

* Small fixes

* Address review comments


          Add support for precomputed templates in filterbin (#899)

76e01d1

* Add support for precomputed templates in filterbin

* Move the kernel width optimization inside the lowpass and highpass filters

* Another cleanup of the lowpass API

* demodulate metadata

* Add time evolution to binned ground template

* Allow computing the observation matrix for subset of detectors

* Move template covariance calculation and regression into the SparseTemplates class; add names to all templates

* Use a more descriptive name for the compiled kernels

* allow outputting the template coefficients from FilterBin

* Cannot skip detectors just for observation matrix because the inverse covariance is affected.

* Fix symbol crash

* Resolve serial test failure


          Wheel tests for a41 tag (#901)

d84c13b

* Wheel tests for a41 tag

* Switch to macos-15 and macos-15-intel runners.  Update cibuildwheel version

* Only install spt3g as a testing dependency on linux

* Skip running wheel tests on macos-15-intel.  There is a segfault in a unit test that is not reproducible running in valgrind on linux.  It also does not show up on macos arm64.  Since the Intel architecture is deprecated for macos, it is not worth digging deeper.

* Restore tests


          Fix issues loading version-1 HDF5 files (#907)

11046e2

* Fix issues loading version-1 HDF5 files

- Handle case where version format string was missing

- Add standalone script for generating / comparing HDF5 files.  This
  was used to generate v1 files with an older toast code version and
  verify loading with latest code version.

- Add an MPI barrier when writing HDF5 files to a temp location and
  moving them into place.

* Fix indexing typo

* Address review comments


          Fix focalplane loading in workflows (#905)

d8e54d9

* Upgrade to new API

* fix typo


          sim_tod_atm_observe.py: Avoid repeatly access det_flags and det_data

71b9d69


          sim_tod_atm_observe.py: Avoid repeatly access det_flags and det_data (#…

dab3955

…904)


          Update timing plot (#909)

b439dca

* Fix calls to deprecated plotly API

* Update copyright


          Add support for indexing an HDF5 volume (#908)

70ee64e

* Add support for indexing an HDF5 volume

- Add a new VolumeIndex class, which interfaces to an sqlite DB that
  can be located anywhere (e.g. a different filesystem with better
  sqlite performance).  Support indexing arbitrary Observation metadata
  and nested attributes.  Support selection of observations with
  standard SQL across an arbitrary MPI communicator.

- Add util function for connections to an sqlite DB with options tuned
  for parallel / networked filesystems.

- In SaveHDF5, support creating a hierarchy of sub-directories in the volume,
  arranged by the first 5 digits of UNIX time and / or session name.

- In LoadHDF5, add support for arbitrary SQL select commands when choosing
  observations to load.

- Expand unit tests to include various uses of the index.

- Remove un-needed imports.

* For GroundSite observations, add mean Az/El and RA/DEC to the index

* Fix typo in serial case

* Remove deprecated SaveHDF5 option from a unit test

* To minimize chances of conflict with sqlite creation between processes, create the DB in a temp location and move into place.


          bugfix in crosslinking.py

651522f

I don't fully understand why there is this code block, but based on failures of the code, surely the pi/2 value should be only for "tiny" angles?


          bugfix in crosslinking.py

609e8c4


          Small changes for dealing with numpy arrays of unicode strings (#910)

97dc7b3

* Small changes for dealing with numpy arrays of unicode strings

Numpy-2 introduced arrays of variable-length unicode strings.  These
are incompatible with h5py datasets.  This PR:

- Adds helper functions for replacing numpy unicode arrays with fixed
  width byte strings.

- Process all observation metadata through these conversion functions
  when saving to HDF5.

* Simplify HDF5 file naming.  Remove the (often redundant) 'obs_' prefix
and also remove the integer UID suffix, since observation names are
already required to be unique.

* Ensure that observations always have a unique name

* Add test case for name-less observation


          Type 3 LAT scans with starting elevation nod phase per throw (#912)

8cbd3bf

* elevation modulation sine starting at fixed phase every throw 

---------

Co-authored-by: Reijo Keskitalo <[email protected]>


          Fix loading of HDF5 volumes (#913)

7420a31

Older HDF5 volumes will not have an index.  If the user does not
explicitly disable use of the index, simply print a warning and
proceed to scan the filesystem as if the index was disabled.

Also fix a corner case where metadata objects containing the keyword
"type" would break the technique we use to instantiate classes
based on the contents of the HDF5 attributes before calling the
class loading function.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet