mokume Optimization Discussion

1. During `feature` normalization (`features2peptides`), it may be necessary to first apply a series of filters to peptides (features), and then compute `peptide frequencies` and `normalization factors` (e.g., global median), to ensure that the results are not affected by contaminants, decoys, or other artifacts. ([corresponding code](https://github.com/bigbio/mokume/blob/main/mokume/normalization/peptide.py#L670-L679))

**Suggestions from Tony:**

> 1. Keep only intensity > 1. Reasoning: You'll notice when seeing spectra visually that intensities 0 to 1 typically appear to be background noise
> 2. Log2 transform intensity data (log transform intensity before any normalization / feature removal / summarization). Reasoning: Intensities are typically right-skew, which can lower the power of differential abundance analysis (also logs are easier to handle in statistics)
> 3. [Merge fractions] MSstats: if a feature is measured across multiple fractions for a sample, MSstats takes the maximum intensity among them.  Assumption here is that a peptide ion should elute dominantly in one fraction (and signal in other fractions is likely noise)
> 4. Batch effect correction may not be appropriate, especially when the definition of a batch is unclear here.  Also since each run is associated with a single biological condition, treating each run as a batch would remove the biological effect of interest.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mokume Optimization Discussion #3

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

mokume Optimization Discussion #3

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions