Preprocessing of raw data

preprocess_data(df, Column_gene_name = "Gene.names",
  Column_score = "Score", Column_ID = "Protein.IDs",
  Column_Npep = NULL, Column_intensity_pattern = "^Intensity.",
  bait_gene_name, condition = NULL, bckg_bait = bait_gene_name,
  bckg_ctrl = "WT", log = TRUE, filter_time = NULL,
  filter_bio = NULL, filter_tech = NULL, min_score = 0,
  filter_gene_name = TRUE, ...)

Arguments

df	Data.frame with protein intensities
Column_gene_name	Column with gene names
Column_score	Column with protein identification score
Column_ID	Column with protein IDs
Column_Npep	Column with number of theoretically observable peptides per protein
Column_intensity_pattern	Pattern (regular exrpression) used to identfy df's columns containing protein intensity values
bait_gene_name	The gene name of the bait
condition	data.frame with columns "column", bckg", "bio", "time" and "tech" indicating for each intensity column ("sample") its corresponding background ("bckg"), biologicla replicate ("bio), experimental condition ("tine) and technical replicate ("tech).
bckg_bait	Name of the bait background as found in `condition$bckg` (see below)
bckg_ctrl	Name of the control background as found in `condition$bckg` (see below)
log	logical, use geometric mean to average technical replicates
filter_time	vector of experimental conditions to exclude from analysis
filter_bio	vector of biological replicates to exclude from analysis
filter_tech	vector of technical replicates to exclude from analysis
min_score	threshold on identification score
filter_gene_name	logical, filter out proteins withy empty gene name
...	Additional parameters passed to function `identify_conditions`

Preprocessing of raw data

Arguments

Contents