genevarfilter
Filter genes with small profile variance
Syntax
Mask
= genevarfilter(Data
)
[Mask
, FData
]
= genevarfilter(Data
)
[Mask
, FData
, FNames
]
= genevarfilter(Data
, Names
)
genevarfilter(..., 'Percentile', PercentileValue
,
...)
genevarfilter(..., 'AbsValue', AbsValueValue
,
...)
Arguments
Data | DataMatrix object or numeric matrix where each row corresponds to a gene. If a matrix, the first column is the names of the genes, and each additional column is the results from an experiment. |
Names | Cell array of character vectors or string vector where each element corresponds to the
name of a gene for each row of experimental data.
|
PercentileValue | Specifies a percentile below which gene expression profiles
are removed. Choices are integers from |
AbsValueValue | Property to specify an absolute value below which gene expression profiles are removed. |
Description
Gene profiling experiments typically include genes that exhibit little variation in their profile and are generally not of interest. These genes are commonly removed from the data.
calculates
the variance for each gene expression profile in Mask
= genevarfilter(Data
)Data
and
returns Mask
, which identifies the gene
expression profiles with a variance less than the 10th percentile. Mask
is
a logical vector with one element for each row in Data
.
The elements of Mask
corresponding to rows
with a variance greater than the threshold have a value of 1
,
and those with a variance less than the threshold are 0
.
[
calculates
the variance for each gene expression profile in Mask
, FData
]
= genevarfilter(Data
)Data
and
returns FData
, a filtered data matrix,
in which the low-variation gene expression profiles are removed. You
can also create FData
using
.FData
=
Data(Mask
,:)
[
returns Mask
, FData
, FNames
]
= genevarfilter(Data
, Names
)FNames
, a filtered names array, in which the names
associated with low-variation gene expression profiles are removed.
Names
is a cell array of character vectors or string
vector of the names of the genes corresponding to each row in
Data
. You can also create
FNames
using
.FNames
=
Names(Mask
)
Note
If Data
is a DataMatrix object with
specified row names, you do not need to provide the second input Names
to
return the third output FNames
.
genevarfilter(..., '
calls PropertyName
', PropertyValue
,
...)genevarfilter
with optional
properties that use property name/property value pairs. You can specify
one or more properties in any order. Each PropertyName
must
be enclosed in single quotation marks and is case insensitive. These
property name/property value pairs are as follows:
genevarfilter(..., 'Percentile',
removes from PercentileValue
,
...)Data
, the
experimental data, the gene expression profiles with a variance less
than the percentile specified by PercentileValue
.
Choices are integers from 0
to 100
.
Default is 10
.
genevarfilter(..., 'AbsValue',
removes from AbsValueValue
,
...)Data
, the
experimental data, the gene expression profiles with a variance less
than AbsValueValue
.
Examples
Load the MAT-file, provided with the Bioinformatics Toolbox™ software, that contains yeast data. This MAT-file includes three variables:
yeastvalues
, a matrix of gene expression data,genes
, a cell array of GenBank® accession numbers for labeling the rows inyeastvalues
, andtimes
, a vector of time values for labeling the columns inyeastvalues
load yeastdata
Filter genes with a small profile variance.
[fyeastvalues, fgenes] = genevarfilter(yeastvalues,genes);
References
[1] Kohane I.S., Kho A.T., Butte A.J. (2003), Microarrays for an Integrative Genomics, Cambridge, MA:MIT Press.
Version History
Introduced before R2006a