Filtering of a data frame using a threshold on protein identification score and gene names

filter_Proteins(df, min_score = 0, filter_gene_name = TRUE,
  Column_ID = "Protein.IDs", Column_gene_name = "Gene.names",
  Column_score = "Score", split_param = ";")

Arguments

df

A data frame

min_score

Threshold for protein identification score

filter_gene_name

logical, filter out proteins withy empty gene name

Column_ID

Column with protein IDs

Column_gene_name

The name of df's column containing gene names

Column_score

The name of df's column containing protein identification score

split_param

Character used to split gene names into substrings.

Value

A filtered data frame. Contains an extra column with the first substring of the column Column_gene_name