コーパス検索結果 (left1)
通し番号をクリックするとPubMedの該当ページを表示します
1 Pfam clans are described in detail, together with the ne
2 Pfam contains multiple alignments and hidden Markov mode
3 Pfam homology and domain boundary annotations in the tar
4 Pfam is a collection of multiple alignments and profile
5 Pfam is a comprehensive collection of protein domains an
6 Pfam is a database of protein families that currently co
7 Pfam is a large collection of protein domains and famili
8 Pfam is a large collection of protein families and domai
9 Pfam is a large collection of protein multiple sequence
10 Pfam is a large collection of protein multiple sequence
11 Pfam is a widely used database of protein families and d
12 Pfam is a widely used database of protein families, curr
13 Pfam is available on the web in the UK, the USA, France
14 Pfam is available on the web in the UK, the USA, France
15 Pfam is available on the World Wide Web in the UK at htt
16 Pfam is available on the WWW in the UK at http://www.san
17 Pfam is available via servers in the UK, the USA and Swe
18 Pfam is available via servers in the UK, the USA and Swe
19 Pfam is now based not only on the UniProtKB sequence dat
20 Pfam protein domains are often thought of as evolutionar
21 Pfam release 24.0 contains 11,912 families, of which a l
22 Pfam search results have been calculated for the entire
23 Pfam term enrichment analysis revealed 172 protein famil
24 Pfam, available via servers in the UK and the USA, is a
25 Pfam-B, the automatically-generated supplement to Pfam,
26 systematic large-scale study of nearly 2,000 Pfam protein families with sufficient sequence informati
30 39), DUF399 (domain of unknown function 399; Pfam ID: PF04187) and MARTX toxins that contribute to ho
31 unique domain-domain interactions among 4036 Pfam domains, out of which 4349 are inferred from PDB en
32 as diseasesusceptible and 32 proteins and 67 Pfam families (10,783 domains) as diseaseresistant based
35 F0-ATPase-regulatory proteins representing a Pfam protein family of 246 sequences from 219 species (P
36 ts: A neutral nucleotide model compared to a Pfam domain encoding model (PSILC(nuc/dom)); A protein c
40 he protein families data base of alignments (Pfam) analysis suggested the wit3.0 peptide sequence sha
41 e three-dimensional structure models for all Pfam-A sequence families with average length under 150 r
43 transcription factor-family curation of all Pfam domains, incorporated the Gene Ontology classificat
44 modelling of discontinuous domains allowing Pfam domain definitions to be closer to those found in s
45 heir UniProt BLASTX hits, GO annotation, and Pfam analysis results, are freely accessible as a public
46 to other databases, such as ArrayExpress and Pfam, entries from which are also cited widely in the li
47 B database are, however, still available and Pfam annotations for individual UniProtKB sequences can
48 sequence-only (Superfamily, PDBAA BLAST and Pfam) and sequence-structure-based (SAM-T02, 3D-PSSM, mG
49 yme (carbohydrate active enzyme) classes and Pfam clans, which attested its usefulness in the phyloge
51 ies directly using RPS-BLAST against COG and Pfam databases and indirectly via proxygenes that are id
53 esentatives of the exhaustive databases, and Pfam-A and Superfamily as databases that predefine famil
54 domain classifications such as InterPro and Pfam, and other ontologies such as mammalian phenotype a
55 n information from Superfamily, InterPro and Pfam; three-dimensional structures at the Protein Data B
59 me structures as described by CATH, SCOP and Pfam, and is available as an interactive website or a fl
60 us sequence databases, domains from SCOP and Pfam, patterns from Prosite and other predicted sequence
64 ion of the sequence family databases such as Pfam and Interpro with the structure-oriented databases
65 rences to other biological resources such as Pfam, SCOP, CATH, GO, InterPro and the NCBI taxonomy dat
69 ysis to other databases: Reactome, BioCarta, Pfam, PID and SMART, finding additional hits in ErbB and
71 ve procedure to that of the HMMs provided by Pfam and SUPERFAMILY, curated ensembles of multiple alig
73 m, available at http://ipfam.org, catalogues Pfam domain interactions based on known 3D structures th
74 r with associated data including SCOP, CATH, Pfam, SWISSPROT, InterPro, GO terms, Protein Quaternary
75 h alignment data from the public collections Pfam and SMART, as well as with contributions from colle
76 These results show that genes containing Pfam domains associated with duplication resistance in A
77 rches against a reference library containing Pfam-annotated UniProt sequences and random synthetic se
78 ies used in this study and the corresponding Pfam families is available at http://www.sanger.ac.uk/Us
79 cid sequences that matches the corresponding Pfam family seed alignment, an alignment of DNA sequence
81 he structure's key reference, citation data, Pfam domain diagrams, topology diagrams and protein-prot
82 Each report also contains expression data, Pfam domain information and an associated Mouse Mutant P
84 e method uses the Protein Families database (Pfam) and motif finding algorithms to identify oligonucl
85 f HMMs is taken from two existing databases (Pfam and SUPERFAMILY), and is limited to models that exc
91 ble protein family databases (Blocks + DOMO, Pfam, PIR-ALN, PRINTS, PROSITE, ProDom, PROTOMAP, SBASE,
92 vel highly conserved protein domain, DUF162 [Pfam: PF02589], can be mapped to two proteins: LutB and
95 am/ and in the US at http://genome.wustl.edu/Pfam/ Pfam 2.0 matches one or more domains in 50% of Swi
97 proteins, but also in erythromycin esterase (Pfam ID: PF05139), DUF399 (domain of unknown function 39
98 open reading frame, kinase) protein family (Pfam 00480) is a large collection of bacterial polypepti
99 embers of an uncharacterized protein family (Pfam PF08000), which provide compelling evidence for the
101 (max 6.6%) for species, 9.0% (max 28.7%) for Pfam protein domains and 9.4% (max 22.9%) for PANTHER ge
102 stimate that more than 4000 contact maps for Pfam families of unknown structure have more than 50% of
103 n the str and stl families is available from Pfam; alignments of all translations are available at ht
104 m Prints not present in PROSITE, blocks from Pfam-A not present in PROSITE or Prints, and so on for P
105 UniProt sequence database, domain data from Pfam, metabolic pathway and functional data from COGs, K
106 CDD collection contains models imported from Pfam, SMART and COG, as well as domain models curated at
107 and DIP; functional domain information from Pfam; protein fingerprints from PRINTS; protein family a
109 gent query domains, originally selected from Pfam, and full-length proteins containing their homologo
110 ered (overall, 82.9% of sequences taken from Pfam) and the alignment of amino acid sequences restrict
111 protein family (domain of unknown function; Pfam families PF07005 and PF17042) and (ii) discovered n
112 ce polymorphisms) and family resources (e.g. Pfam and eggNog) and displayed on the Gene3D website.
114 f ROK glucokinases and non-ROK glucokinases (Pfam 02685), revealing the primary sequence elements sha
115 ltritol to d-tagatose via a dehydrogenase in Pfam family PF00107, a previously unknown reaction; 2) p
117 r the past 2 years the number of families in Pfam has doubled and now stands at 6190 (version 10.0).
118 ions: (i) for all protein domain families in Pfam, the fixation of genes horizontally transferred is
119 integral membrane proteins, cereal genes in Pfam family PF02458 emerged as candidates for the ferulo
120 se to d-tagatose 6-phosphate via a kinase in Pfam family PF00294, a previously orphan EC number; and
121 uently scored more remotely related Pfams in Pfam clans higher than closely related Pfams, thus, lead
123 e have expanded this coverage by integrating Pfam and SUPERFAMILY domain annotations, and we now reso
125 arge protein families (including the largest Pfam alignment containing 27000 HIV GP120 glycoprotein s
126 ilies to the Wikipedia community, by linking Pfam families to relevant Wikipedia pages and encouragin
129 using protein regions that match two or more Pfam families not currently annotated as related in Pfam
131 shold, (iii) recognition of FDRs in Multiple Pfam enzyme families, and (iv) recognition of multiple P
135 antial phylogenetic separations (1.1-9.7% of Pfam families surveyed at three taxonomic ranges studied
140 lent domains, and conversely the majority of Pfam domains sampled by our data play no currently estab
141 -4 to d-fructose 6-phosphate via a member of Pfam family PF08013, another previously unknown reaction
144 parsimony approach to compare repertoires of Pfam domains and their combinations, we show that indepe
145 In contrast, in a non-redundant sample of Pfam-AB, only 1% of four-amino acid word clumps (4.7% of
146 range of Pfam web tools and the first set of Pfam web services that allow programmatic access to the
147 represents the first eukaryotic structure of Pfam family PF03937 and reveals a conserved surface regi
150 ade using a new algorithm based primarily on Pfam domain occurrence patterns in mitochondrial and non
151 ene Ontology Annotation projects; updates on Pfam, SMART and InterPro domain databases; update papers
152 arch options allow search by UniProt code or Pfam domain identifier, and results can be filtered by d
154 , diverse source databases: Gene3D, PANTHER, Pfam, PIRSF, PRINTS, ProDom, PROSITE, SMART, SUPERFAMILY
155 d in the US at http://genome.wustl.edu/Pfam/ Pfam 2.0 matches one or more domains in 50% of Swissprot
156 nd Genomes (KEGG) annotations, and potential Pfam domains were assigned to each transcript isoform.
161 ignature databases: PROSITE, PRINTS, ProDom, Pfam, SMART, TIGRFAMs, PIRSF, SUPERFAMILY, Gene3D and PA
162 oteins was predicted with pattern profilers (Pfam, Prosite, TMHMM, and pSORT), and by examining queri
163 ent; and integration with PROSITE, profiles, Pfam and ProDom, as part of the international InterPro p
166 ported as being homologues of TraB proteins (Pfam ID: PF01963), a widely distributed family of unknow
168 o expert annotations of domain-like regions (Pfam-A) and completing through cuts based on termini of
171 ures are the first examples of the Rep_trans Pfam family providing insights into the replication of n
172 tion features important updates on the EBI's Pfam, PDBe and PRIDE databases, as well as a variety of
176 are/Pfam/, in Sweden at http://www.cgb.ki.se/Pfam/, in France at http://pfam.jouy.inra.fr/ and in the
178 e frequently found to consist of known short Pfam domains, e.g., leucine-rich repeats, tetratricopept
180 for the presence of protease cleavage sites, Pfam domains, glycosylation sites, signal peptides, tran
181 domain alignment collections, such as SMART, Pfam and COG, we have continued an effort to update, and
183 n Europe at http://www.sanger.ac.uk/Software/Pfam/ and http://www.cgr.ki.se/Pfam/, and in the US at h
184 n the UK at http://www.sanger.ac.uk/Software/Pfam/, in Sweden at http://www.cgb.ki.se/Pfam/, in Franc
185 n the UK at http://www.sanger.ac.uk/Software/Pfam/, in Sweden at http://www.cgr.ki.se/Pfam/ and in th
186 browsing at http://www.sanger.ac.uk/Software/Pfam/iPfam/; the source-data for iPfam is freely availab
188 istory of each seed sequence in the spurious Pfam protein family (PF10695, 'Cw-hydrolase') uncovered
189 ences, but for proteins with known structure Pfam matches 95%, which we believe represents the likely
191 available, have been utilised to ensure that Pfam families correspond with structural domains, and to
192 he most significant of these changes is that Pfam is now primarily based on the UniProtKB reference p
193 nce databases the fraction of sequences that Pfam matches is reduced, suggesting that continued addit
199 ting Intolerant from Tolerant (SIFT) and the Pfam-based LogR.E-value method, we have identified featu
202 method precludes definitive conclusions, the Pfam models provide the only tertiary structure informat
203 s of a nonredundant amino acid database, the Pfam domain database, plant Expressed Sequence Tags, and
204 relevant Wikipedia pages and encouraging the Pfam and Wikipedia communities to improve and expand tho
206 iple sequence alignment as obtained from the Pfam database (a database of protein families and conser
212 , using 1750 gene trees constructed from the Pfam protein family database, that it appears to be a pr
214 in the database by adding families from the Pfam-A, ProDom and Domo databases to those from PROSITE
216 the domain of unknown function DUF28 in the Pfam and PALI databases for which there was no structura
217 enome that contain a domain described in the Pfam database as domain of unknown function 579 (DUF579)
218 known relationships between families in the Pfam database as well as detect novel distant relationsh
220 manually curated inclusion thresholds in the Pfam database, especially on the subset of families that
224 ition-specific scoring functions used in the Pfam models, the score statistics of hybrid alignment ob
225 6 models of protein domains contained in the Pfam v5.4 database verifies the theoretical predictions:
226 nces that contain the coding sequence of the Pfam alignment when they can be recovered (overall, 82.9
228 nces derived from the seed alignments of the Pfam database of amino acid alignments of families of ho
230 s intended for genome-scale searching of the Pfam database without having to install this database an
233 The new method, MITOPRED, is based on the Pfam domain occurrence patterns and the amino acid compo
235 Methodology improvements for searching the Pfam collection locally as well as via the web are descr
237 ion was compared to a proteome data set, the Pfam domain database, and the genomes of six other fungi
243 t of bioactivities stored in ChEMBL with the Pfam-A domain most likely to mediate small molecule bind
244 n interaction data was integrated within the Pfam database and website, but it has now been migrated
250 oups) and KO (KEGG Orthology) in addition to Pfam domains; (iii) information on intronless genes are
255 Small molecule binding has been mapped to Pfam-A domains of protein targets in the ChEMBL bioactiv
256 lust sequences are annotated with matches to Pfam, SCOP domains, and proteins in the PDB, using our H
258 ide Web in the UK at http://www.sanger.ac.uk/Pfam/ and in the US at http://genome.wustl.edu/Pfam/ Pfa
260 than a thousand structurally uncharacterized Pfam families to achieve reasonable structural annotatio
262 updated version to 26,219 among 5140 unique Pfam domains, a 23% increase compared to 20,513 unique D
263 kground information, domain architecture via Pfam links, a list of all sequences and an assessment of
WebLSDに未収録の専門用語(用法)は "新規対訳" から投稿できます。