hgu2beta7REFSEQ {hgu2beta7} | R Documentation |
The Reference Sequence (RefSeq) database contains curated non-redundant set of sequences for genomic DNA, RNA, and protein for various organisms. hgu2beta7REFSEQ maps probe identifiers to all the RefSeq identifiers that are mapped by NCBI to genes represented by the probe identifiers
RefSeq ids differ in format according to the type of record the ids are for as shown below:
NG_XXXXX: RefSeq accessions for genomic region (nucleotide) records
NM_XXXXX: RefSeq accessions for mRNA records
NC_XXXXX: RefSeq accessions for chromosome records
NP_XXXXX: RefSeq accessions for protein records
XR_XXXXX: RefSeq accessions for model RNAs that are not associated with protein products
XM_XXXXX: RefSeq accessions for model mRNA records
XP_XXXXX: RefSeq accessions for model protein records
Where XXXXX is a sequence of integers.
NCBI http://www.ncbi.nlm.nih.gov/RefSeq/ allows users to query the RefSeq database using RefSeq ids.
Mappings were based on data provided by:
LocusLink:ftp://ftp.ncbi.nih.gov/refseq/LocusLink/LL_tmpl.gz. Built: February 16, 2005
Package built Wed Feb 16 23:32:20 2005
NCBI http://www.ncbi.nlm.nih.gov and RefSeQ http://www.ncbi.nlm.nih.gov/RefSeq/
# Convert the environment to a list xx <- as.list(hgu2beta7REFSEQ) # Remove probe ids that do not map to any RefSeq xx <- xx[!is.na(xx)] if(length(xx) > 0){ # The RefSeq for the first two elements of XX xx[1:2] # Get the first one xx[[1]] }