蛋白质数据库的介绍

2024-05-19 11:23

1. 蛋白质数据库的介绍

蛋白质数据库是指包括蛋白质信息的数据库。常用的蛋白质数据库有很多，其中Uniprot被认为收录最广泛和注释信息最全面的蛋白质数据库。Uniprot下包括Swiss-Prot、TrEMBL和PIR-PSD，详见Uniprot_百度百科。其他的蛋白数据库有PDB（Protein Data Bank，简称PDB，开始建立于1971年）等。国内也有些如由上海生物信息技术研究中心下属的生物信息科学数据共享平台建立及维护的SDSPB等。

蛋白质数据库的介绍

2. 蛋白质数据库的简介

蛋白质数据库（HPDB），建于2005年5月， 动态展示生物大分子立体结构，鼠标点击放大分子结构、原子定位、测定原子之间距离，可用于教学或科研。 服务对象是能够熟练使用中文的生命科学、医学、药学、农学、林学等领域的大中专学生、教师及科技工作者 。分子结构特征描述采用汉语，同时提供英文原文以供考证。 对于善于使用英文的读者，我们提倡直接访问RCSB PDB，一来可以减少网络拥挤，二来可以减少由于 HPDB 的翻译不妥带来的不便。蛋白质数据库（HPDB）对每个蛋白质分子结构说明部分做了中文翻译（最新加入数据库的分子除外），内容包括分子结构定性描述、样品的来源、表达载体、宿主、化学分析方法、分子结构组成成分等。 这些信息并同蛋白质分子结构数据存储于数据库， 因此 HPDB 支持中文查询。蛋白质数据库（HPDB）虽然翻译了“分子结构说明”部分，但为了保证数据的可靠性和准确性，HPDB对一级结构序列及大分子结构坐标数据等未做任何改动，数据库保持 RCSB PDB 核实后的原始实验数据文件，并保持 PDB 文件格式和蛋白质分子编号。

3. 蛋白质数据库的内容

Protein Sequence Databases:   Antibodies: Sequence and StructureBRENDA: Enzyme database　　CD Antigens: Database of CD antigens　　dbCFC: Cytokine family database　　Histons: Histone sequence database　　HPRD: Human protein reference database　　InterPro: Intergrated documentation 5resources for protein families　　iProClass: An integrated protein classification database　　KIND: A non-redundant protein sequence database　　MHCPEP: Database of MHC binding peptides　　MIPS: Munich information centre for protein sequences　　PIR: Annotated, and non-redundant protein sequence database　　PIR-ALN: Curated database of protein sequence alignments　　PIR-NREF: PIR nonredundent reference protein database　　PMD: Protein mutant database　　PRF: Protein research foundation, Japan　　ProClass: Non-redundant protein database　　ProtoMap: Hierarchical classification of swissprot proteins　　REBASE: Restriction enzyme database　　RefSeq: Reference sequence database at NCBI　　SwissProt: Curated protein sequence database　　SPTR: Comprehensive protein sequence database　　Transfac: Transcription factor database　　TrEMBL: Annotated translations of EMBL nucleotide sequences　　Tumor gene database: Genes with cancer-causing mutations　　WD repeats: WD-repeat family of proteins  　　Protein Structure Databases:  Cath: Protein structure classification　　HIV Protease: HIV protease database 3D structure　　PDB: 3-D macromolecular structure data　　PSI: Protein structure initiative　　S2F: Structure to function project　　Scop: Structural Classification of Proteins  Protein Domains, Motifs & Signatures:  BLOCKS: Multipe aligned segments of conserved protein regions　　CCD: Conserved domain database and search service　　DOMO: Homologous protein domain families　　Pfam: Database of protein domains and HMMs　　ProDom: Protein domain database　　Prints: Protein motif fingerprint database　　Prosite: Database of protein families and domains　　SMART: Simple modular architecture research tool　　TIGRFAM: Protein families based on HMMs  Others:  Phospho Site: Database of phosphorylation sites

蛋白质数据库的内容