| matchprobes {matchprobes} | R Documentation |
The query sequence, a character string (probably representing
a transcript of interest), is scanned for the presence of exact
matches to the sequences in the character vector records.
The indices of the set of matches are returned.
matchprobes(query, records, probepos=FALSE)
query |
A character vector. For example, each element may represent a gene (transcript) of interest. |
records |
A character vector. For example, each element may represent the probes on a DNA array. |
probepos |
A logical value. If TRUE, return also the start positions of the matches in the query sequence. |
The matching is done using the C library function strstr. It
might be nice to explore other possibilities.
A list.
Its first element is a list of the same length as the input vector.
Each element of the list is a numeric vector containing the indices of
the probes that have a perfect match in the query
sequence.
If probepos is TRUE,
the returned list has a second element: it is of the same shape
as described above, and gives the respective positions of the
matches.
R. Gentleman, Laurent Gautier, Wolfgang Huber
## The main intention for this function is together with the probe
## tables from the "probe" data packages, e.g.:
## > library(hgu95av2probe)
## > data(probe)
## > seq <- probe$sequence
##
## Since we do not want to be dependent on the presence of this
## data package, for the sake of example we simply simulate some
## probe sequences:
bases <- c("A", "C", "G", "T")
seq <- sapply(1:1000, function(x) paste(bases[ceiling(4*runif(256))], collapse=""))
w1 <- seq[20:22]
w2 <- complementSeq(w1, start=13, stop=13)
w <- c(w1, w2)
matchprobes(w, seq)
matchprobes(w, seq, probepos=TRUE)