Skip to content

mokume Optimization Discussion #3

@yueqixuan

Description

@yueqixuan
  1. During feature normalization (features2peptides), it may be necessary to first apply a series of filters to peptides (features), and then compute peptide frequencies and normalization factors (e.g., global median), to ensure that the results are not affected by contaminants, decoys, or other artifacts. (corresponding code)

Suggestions from Tony:

  1. Keep only intensity > 1. Reasoning: You'll notice when seeing spectra visually that intensities 0 to 1 typically appear to be background noise
  2. Log2 transform intensity data (log transform intensity before any normalization / feature removal / summarization). Reasoning: Intensities are typically right-skew, which can lower the power of differential abundance analysis (also logs are easier to handle in statistics)
  3. [Merge fractions] MSstats: if a feature is measured across multiple fractions for a sample, MSstats takes the maximum intensity among them.  Assumption here is that a peptide ion should elute dominantly in one fraction (and signal in other fractions is likely noise)
  4. Batch effect correction may not be appropriate, especially when the definition of a batch is unclear here.  Also since each run is associated with a single biological condition, treating each run as a batch would remove the biological effect of interest.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions