Biological macromolecular structure databases in the artificial intelligence era: Coevolution, transformation, and future architectures

Ruiyun YANG, Jianhua HUANG, Qiangfeng ZHANG

Journal of Tsinghua University(Science and Technology) ›› 2025, Vol. 65 ›› Issue (12) : 2449-2463.

PDF(9088 KB)
PDF(9088 KB)
Journal of Tsinghua University(Science and Technology) ›› 2025, Vol. 65 ›› Issue (12) : 2449-2463. DOI: 10.16511/j.cnki.qhdxxb.2025.21.056

Biological macromolecular structure databases in the artificial intelligence era: Coevolution, transformation, and future architectures

Author information +
History +

Abstract

Significance: Identifying the three-dimensional structures of biological macromolecules is fundamental to understanding the molecular basis of life and for the discovery of novel therapeutics. As the biological sciences enter the era of artificial intelligence (AI), structural data have become increasingly essential, while AI technologies simultaneously impose higher demands on data organization and management. This review traces the five-decade evolution of biological macromolecular structure databases, with a particular focus on the pivotal role of the Protein Data Bank (PDB). The PDB was established as a small archive of experimentally determined atomic coordinates but gradually developed into a global infrastructure that underpins structural biology. Progress: We first chart the progression of structural data resources from early structure archives, which largely functioned as static catalogs of experimentally determined structures, to the emergence of highly curated functional classification systems, such as SCOP and CATH. These resources enable researchers to analyze structural relationships, investigate evolutionary patterns, and derive mechanistic insights. In parallel, sequence-centric databases—such as Pfam, InterPro, and later comprehensive domain-family resources—expanded by annotating conserved elements across the protein domain. Together, these efforts created a rich, multi-layered ecosystem in which the sequence, structure, and function of proteins became increasingly integrated, thereby turning structure databases into indispensable platforms for comparative analysis and mechanistic discovery. A new phase of structural data expansion began with AI-driven structure prediction. The release of the AlphaFold Protein Structure Database (AFDB), followed by complementary resources, including the ESM Atlas, induced an unprecedented expansion in structural coverage, spanning entire proteomes and previously challenging protein families. Conclusions and Prospects: We propose that structural databases and AI models form a mutually reinforcing "double-helix" data model. High-quality experimental structures provide essential references for training and benchmarking predictive models, while large-scale AI-generated structures dramatically increase the amount of available data, thereby revealing new sequence-structure-function relationships, and enriching the databases themselves. This synergy would catalyze a paradigm shift in structural biology, transitioning the field from an experiment-led discipline to an integrated ecosystem in which computation and experimentation may coevolve. Despite rapid progress in this industry, major challenges persist. Structural databases remain affected by experimental sampling biases, uneven representation across organisms and protein families, and persistent inconsistencies in annotation quality. Moreover, the scarcity of dynamic and condition-dependent structural information further limits biological interpretability, particularly for intrinsically disordered regions, conformational ensembles, and transient complexes. Furthermore, AI-driven predictions introduce new concerns regarding model interpretability, calibration of confidence metrics, and the governance of large-scale predictive datasets. We anticipate that biological macromolecular structure databases will evolve from merely "AI-enhanced" to "AI-integrated" and, ultimately, adopt "AI-native" architectures. Such systems will incorporate a continuous feedback model, automated annotation pipelines, and multi-modal data fusion, thereby enabling them to function as reliable knowledge instruments capable of hosting biologically meaningful "digital twins." Collectively, these developments promise to enhance our understanding of structure-function relationships and accelerate rational design in protein engineering, drug discovery, and synthetic biology. As a result, structural databases will continue to underpin scientific innovation while defining a new research standard for biological sciences.

Key words

biological macromolecular structure databases / artificial intelligence (AI) / protein structure prediction / database ecosystem / AI-native

Cite this article

Download Citations
Ruiyun YANG , Jianhua HUANG , Qiangfeng ZHANG. Biological macromolecular structure databases in the artificial intelligence era: Coevolution, transformation, and future architectures[J]. Journal of Tsinghua University(Science and Technology). 2025, 65(12): 2449-2463 https://doi.org/10.16511/j.cnki.qhdxxb.2025.21.056

References

1
DOUBLIÉ S , TABOR S , LONG A M , et al. Crystal structure of a bacteriophage T7 DNA replication complex at 2.2 Å resolution[J]. Nature, 1998, 397(6664): 251- 258.
2
CHEN C , ZHANG H , BROITMAN S L , et al. Dynamics of translation by single ribosomes through mRNA secondary structures[J]. Nature Structural & Molecular Biology, 2013, 20(5): 582- 588.
3
LV Y , QI J X , BABON J J , et al. The JAK-STAT pathway: from structural biology to cytokine engineering[J]. Signal Transduction and Targeted Therapy, 2024, 9(1): 221.
4
SUNO R . Exploring diverse signaling mechanisms of G protein-coupled receptors through structural biology[J]. The Journal of Biochemistry, 2024, 175(4): 357- 365.
5
ARROWSMITH C H . Structure-guided drug discovery: back to the future[J]. Nature Structural & Molecular Biology, 2024, 31(3): 395- 396.
6
BURLEY S K , WU-WU A , DUTTA S , et al. Impact of structural biology and the protein data bank on US FDA new drug approvals of low molecular weight antineoplastic agents 2019-2023[J]. Oncogene, 2024, 43(29): 2229- 2243.
7
DOBSON C M . Biophysical techniques in structural biology[J]. Annual Review of Biochemistry, 2019, 88, 25- 33.
8
WANG L G , ZIMANYI C M . Cryo-EM sample preparation for high-resolution structure studies[J]. Acta Crystallographica. Section F, Structural Biology Communications, 2024, 80(Pt 4): 74- 81.
9
LEE Y , KIM J G , LEE S J , et al. Ultrafast coherent motion and helix rearrangement of homodimeric hemoglobin visualized with femtosecond X-ray solution scattering[J]. Nature Communications, 2021, 12(1): 3677.
10
SHI Y G . A glimpse of structural biology through X-ray crystallography[J]. Cell, 2014, 159(5): 995- 1014.
11
WIDER G . Structure determination of biological macromolecules in solution using nuclear magnetic resonance spectroscopy[J]. BioTechniques, 2000, 29(6): 1278- 1294.
12
OPELLA S J . Structure determination of membrane proteins by nuclear magnetic resonance spectroscopy[J]. Annual Review of Analytical Chemistry, 2013, 6, 305- 328.
13
KVHLBRANDT W . The resolution revolution[J]. Science, 2014, 343(6178): 1443- 1444.
14
BAI X C , MCMULLAN G , SCHERES S H W . How cryo-EM is revolutionizing structural biology[J]. Trends in Biochemical Sciences, 2015, 40(1): 49- 57.
15
NOGALES E , MAHAMID J . Bridging structural and cell biology with cryo-electron microscopy[J]. Nature, 2024, 628(8006): 47- 56.
16
JOOSTEN R P , TE BEEK T A H , KRIEGER E , et al. A series of PDB related databases for everyday needs[J]. Nucleic Acids Research, 2011, 39(S1): D411- D419.
17
BURLEY S K , BERMAN H M , DUARTE J M , et al. Protein data bank: A comprehensive review of 3D structure holdings and worldwide utilization by researchers, educators, and students[J]. Biomolecules, 2022, 12(10): 1425.
18
TOUW W G , BAAKMAN C , BLACK J , et al. A series of PDB-related databanks for everyday needs[J]. Nucleic Acids Research, 2015, 43(D1): D364- D368.
19
SCAPIN G . Structural biology and drug discovery[J]. Current Pharmaceutical Design, 2006, 12(17): 2087- 2097.
20
RASTINEJAD F , HUANG P X , CHANDRA V , et al. Understanding nuclear receptor form and function using structural biology[J]. Journal of Molecular Endocrinology, 2013, 51(3): T1- T21.
21
WANG X , SONG K , LI L , et al. Structure-based drug design strategies and challenges[J]. Current Topics in Medicinal Chemistry, 2018, 18(12): 998- 1006.
22
JUMPER J , EVANS R , PRITZEL A , et al. Highly accurate protein structure prediction with AlphaFold[J]. Nature, 2021, 596(7873): 583- 589.
23
ABRAMSON J , ADLER J , DUNGER J , et al. Accurate structure prediction of biomolecular interactions with AlphaFold 3[J]. Nature, 2024, 630(8016): 493- 500.
24
PAKHRIN S C , SHRESTHA B , ADHIKARI B , et al. Deep learning-based advances in protein structure prediction[J]. International Journal of Molecular Sciences, 2021, 22(11): 5553.
25
BURLEY S K , BHIKADIYA C , BI C , et al. RCSB protein data bank (RCSB.org): Delivery of experimentally-determined PDB structures alongside one million computed structure models of proteins from artificial intelligence/machine learning[J]. Nucleic Acids Research,, 2023, 51(D1)): D488- D508.
26
Crystallography: Protein data bank[J]. Nature New Biology, 1971, 233(42): 223.
27
BERMAN H , HENRICK K , NAKAMURA H . Announcing the worldwide protein data bank[J]. Nature Structural & Molecular Biology, 2003, 10(12): 980- 980.
28
ARMSTRONG D R , BERRISFORD J M , CONROY M J , et al. PDBe: Improved findability of macromolecular structure data in the PDB[J]. Nucleic Acids Research, 2020, 48(D1): D335- D343.
29
KINJO A R , BEKKER G J , SUZUKI H , et al. Protein data bank japan (PDBj): Updated user interfaces, resource description framework, analysis tools for large structures[J]. Nucleic Acids Research, 2017, 45(D1): D282- D288.
30
XU W Q , VELANKAR S , PATWARDHAN A , et al. Announcing the launch of protein data bank China as an associate member of the worldwide protein data bank partnership[J]. Acta Crystallographica Section D Structural Biology, 2023, 79(9): 792- 795.
31
VALLAT B , TAURIELLO G , BIENERT S , et al. ModelCIF: An extension of PDBx/mmCIF data representation for computed structure models[J]. Journal of Molecular Biology, 2023, 435(14): 168021.
32
SINGH H , RAGHAVA G P S . BLAST-based structural annotation of protein residues using protein data bank[J]. Biology Direct, 2016, 11(1): 4.
33
SEHNAL D , BITTRICH S , DESHPANDE M , et al. Mol* Viewer: Modern web app for 3D visualization and analysis of large biomolecular structures[J]. Nucleic Acids Research, 2021, 49(W1): W431- W437.
34
BITTRICH S , SEGURA J , DUARTE J M , et al. RCSB protein data bank: Exploring protein 3D similarities via comprehensive structural alignments[J]. Bioinformatics, 2024, 40(6): btae370.
35
MURZIN A G , BRENNER S E , HUBBARD T , et al. SCOP: A structural classification of proteins database for the investigation of sequences and structures[J]. Journal of Molecular Biology, 1995, 247(4): 536- 540.
36
ORENGO C A , PEARL F M G , BRAY J E , et al. The CATH Database provides insights into protein structure/function relationships[J]. Nucleic Acids Research, 1999, 27(1): 275- 279.
37
PEARL F M G , BENNETT C F , BRAY J E , et al. The CATH database: An extended protein family resource for structural and functional genomics[J]. Nucleic Acids Research, 2003, 31(1): 452- 455.
38
CHANDONIA J M , FOX N K , BRENNER S E . SCOPe: Classification of large macromolecular structures in the structural classification of proteins—extended database[J]. Nucleic Acids Research, 2019, 47(D1): D475- D481.
39
CHANDONIA J M , GUAN L , LIN S Y , et al. SCOPe: Improvements to the structural classification of proteins- extended database to facilitate variant interpretation and machine learning[J]. Nucleic Acids Research, 2022, 50(D1): D553- D559.
40
CHEN V B , ARENDALL W B , HEADD J J , et al. MolProbity : All-atom structure validation for macromolecular crystallography[J]. Acta Crystallographica Section D Biological Crystallography, 2010, 66(1): 12- 21.
41
LASKOWSKI R A , JABŁOŃSKA J , PRAVDA L , et al. PDBsum: Structural summaries of PDB entries[J]. Protein Science, 2018, 27(1): 129- 134.
42
BASKARAN K , DUARTE J M , BIYANI N , et al. A PDB-wide, evolution-based assessment of protein-protein interfaces[J]. BMC Structural Biology, 2014, 14(1): 22.
43
FINN R D , COGGILL P , EBERHARDT R Y , et al. The Pfam protein families database: Towards a more sustainable future[J]. Nucleic Acids Research, 2016, 44(D1): D279- D285.
44
SONNHAMMER E L L , EDDY S R , BIRNEY E , et al. Pfam: Multiple sequence alignments and HMM-profiles of protein domains[J]. Nucleic Acids Research, 1998, 26(1): 320- 322.
45
PAYSAN-LAFOSSE T , ANDREEVA A , BLUM M , et al. The Pfam protein families database: embracing AI/ML[J]. Nucleic Acids Research, 2025, 53(D1): D523- D534.
46
SCHULTZ J , COPLEY R R , DOERKS T , et al. SMART: A web-based tool for the study of genetically mobile domains[J]. Nucleic Acids Research, 2000, 28(1): 231- 234.
47
LETUNIC I , KHEDKAR S , BORK P . SMART: Recent updates, new developments and status in 2020[J]. Nucleic Acids Research, 2021, 49(D1): D458- D460.
48
PORTER C T , BARTLETT G J , THORNTON J M . The catalytic site atlas: A resource of catalytic sites and residues identified in enzymes using structural data[J]. Nucleic Acids Research, 2004, 32(S1): D129- D133.
49
WANG R X , FANG X L , LU Y P , et al. The PDBbind database: Collection of binding affinities for protein-ligand complexes with known three-dimensional structures[J]. Journal of Medicinal Chemistry, 2004, 47(12): 2977- 2980.
50
WANG R X , FANG X L , LU Y P , et al. The PDBbind database: Methodologies and updates[J]. Journal of Medicinal Chemistry, 2005, 48(12): 4111- 4119.
51
MERING C V . STRING: A database of predicted functional associations between proteins[J]. Nucleic Acids Research, 2003, 31(1): 258- 261.
52
SZKLARCZYK D , NASTOU K , KOUTROULI M , et al. The STRING database in 2025: Protein networks with directionality of regulation[J]. Nucleic Acids Research, 2025, 53(D1): D730- D737.
53
The UniProt Consortium . UniProt: A worldwide hub of protein knowledge[J]. Nucleic Acids Research, 2019, 47(D1): D506- D515.
54
HUNTER S , APWEILER R , ATTWOOD T K , et al. InterPro: The integrative protein signature database[J]. Nucleic Acids Research, 2009, 37(S1): D211- D215.
55
WEISSENBERGER G , HENDERIKX R J M , PETERS P J . Understanding the invisible hands of sample preparation for cryo-EM[J]. Nature Methods, 2021, 18(5): 463- 471.
56
The wwPDB Consortium . EMDB: The electron microscopy data bank[J]. Nucleic Acids Research, 2024, 52(D1): D456- D465.
57
IUDIN A , KORIR P K , SOMASUNDHARAM S , et al. EMPIAR: The electron microscopy public image archive[J]. Nucleic Acids Research, 2023, 51(D1): D1503- D1511.
58
DAI M Z, DONG Z E, FU W N, et al. CryoDomain: Sequence-free protein domain identification from low- resolution Cryo-EM density maps[C]//Proceedings of the 39th AAAI Conference on Artificial Intelligence, Philadelphia, USA: AAAI Press, 2025: 119-127.
59
ISBERG V , DE GRAAF C , BORTOLATO A , et al. Generic GPCR residue numbers-aligning topology maps while minding the gaps[J]. Trends in Pharmacological Sciences, 2015, 36(1): 22- 31.
60
LORENTE J S , SOKOLOV A V , FERGUSON G , et al. GPCR drug discovery: New agents, targets and indications[J]. Nature Reviews Drug Discovery, 2025, 24(6): 458- 479.
61
HERRERA L P T , ANDREASSEN S N , CAROLI J , et al. GPCRdb in 2025: Adding odorant receptors, data mapper, structure similarity search and models of physiological ligand complexes[J]. Nucleic Acids Research, 2025, 53(D1): D425- D435.
62
KOOISTRA A J , MUNK C , HAUSER A S , et al. An online GPCR structure analysis platform[J]. Nature Structural & Molecular Biology, 2021, 28(11): 875- 878.
63
LEFRANC M P , GIUDICELLI V , GINESTOUX C , et al. IMGT®, the international ImMunoGeneTics information system®[J]. Nucleic Acids Research, 2009, 37(S1): D1006- D1012.
64
LEFRANC M P . IMGT, the international ImMunoGeneTics information system[J]. Cold Spring Harbor Protocols, 2011, 2011(6): 595- 603.
65
LEFRANC M P , GIUDICELLI V , DUROUX P , et al. IMGT®, the international ImMunoGeneTics information system® 25 years on[J]. Nucleic Acids Research, 2015, 43(D1): D413- D422.
66
BERMAN H M , OLSON W K , BEVERIDGE D L , et al. The nucleic acid database. A comprehensive relational database of three-dimensional structures of nucleic acids[J]. Biophysical Journal, 1992, 63(3): 751- 759.
67
COIMBATORE NARAYANAN B , WESTBROOK J , GHOSH S , et al. The nucleic acid database: New features and capabilities[J]. Nucleic Acids Research, 2014, 42(D1): D114- D122.
68
RIGDEN D J , FERNÁNDEZ X M . The 27th annual nucleic acids research database issue and molecular biology database collection[J]. Nucleic Acids Research, 2020, 48(D1): D1- D8.
69
SICKMEIER M , HAMILTON J A , LEGALL T , et al. DisProt: The database of disordered proteins[J]. Nucleic Acids Research, 2007, 35(S1): D786- D793.
70
QUAGLIA F , MÉSZÁROS B , SALLADINI E , et al. DisProt in 2022: Improved quality and accessibility of protein intrinsic disorder annotation[J]. Nucleic Acids Research, 2022, 50(D1): D480- D487.
71
ASPROMONTE M C , NUGNES M V , QUAGLIA F , et al. DisProt in 2024: Improving function annotation of intrinsically disordered proteins[J]. Nucleic Acids Research, 2024, 52(D1): D434- D441.
72
KOZMA D , SIMON I , TUSNÁDY G E . PDBTM: Protein data bank of transmembrane proteins after 8 years[J]. Nucleic Acids Research, 2012, 41(D1): D524- D529.
73
LOMIZE M A , LOMIZE A L , POGOZHEVA I D , et al. OPM: Orientations of proteins in membranes database[J]. Bioinformatics, 2006, 22(5): 623- 625.
74
MUHAMMED M T , AKI-YALCIN E . Homology modeling in drug discovery: Overview, current applications, and future perspectives[J]. Chemical Biology & Drug Design, 2019, 93(1): 12- 20.
75
KIEFER F , ARNOLD K , KUNZLI M , et al. The SWISS-MODEL repository and associated resources[J]. Nucleic Acids Research, 2009, 37((Database): D387- D392.
76
WATERHOUSE A , BERTONI M , BIENERT S , et al. SWISS-MODEL: Homology modelling of protein structures and complexes[J]. Nucleic Acids Research, 2018, 46(W1): W296- W303.
77
VARADI M , ANYANGO S , DESHPANDE M , et al. AlphaFold protein structure database: Massively expanding the structural coverage of protein-sequence space with high-accuracy models[J]. Nucleic Acids Research, 2022, 50(D1): D439- D444.
78
VARADI M , BERTONI D , MAGANA P , et al. AlphaFold protein structure database in 2024: Providing structure coverage for over 214 million protein sequences[J]. Nucleic Acids Research, 2024, 52(D1): D368- D375.
79
LIN Z M , AKIN H , RAO R , et al. Evolutionary-scale prediction of atomic-level protein structure with a language model[J]. Science, 2023, 379(6637): 1123- 1130.
80
LIU T Q , LIN Y , WEN X , et al. BindingDB: A web-accessible database of experimentally determined protein-ligand binding affinities[J]. Nucleic Acids Research, 2007, 35(Database): D198- D201.
81
GILSON M K , LIU T Q , BAITALUK M , et al. BindingDB in 2015: A public database for medicinal chemistry, computational chemistry and systems pharmacology[J]. Nucleic Acids Research, 2016, 44(D1): D1045- D1053.
82
LIU T Q , HWANG L , BURLEY S K , et al. BindingDB in 2024: A FAIR knowledgebase of protein-small molecule binding data[J]. Nucleic Acids Research, 2025, 53(D1): D1633- D1644.
83
GAULTON A , BELLIS L J , BENTO A P , et al. ChEMBL: A large-scale bioactivity database for drug discovery[J]. Nucleic Acids Research, 2012, 40(D1): D1100- D1107.
84
ZDRAZIL B . Fifteen years of ChEMBL and its role in cheminformatics and drug discovery[J]. Journal of Cheminformatics, 2025, 17(1): 32.
85
YU T H , CUI H Y , LI J C , et al. Enzyme function prediction using contrastive learning[J]. Science, 2023, 379(6639): 1358- 1363.
86
Gene Ontology Consortium . The gene ontology (GO) database and informatics resource[J]. Nucleic Acids Research, 2004, 32(S1): D258- D261.
87
SUJ, HE Y, YOU S Y. A trimodal protein language model enables advanced protein searches[J/OL]. Nature Biotechnology, 2025. https://doi.org/10.1038/s41587-025-02836-0.
88
VARADI M , ANYANGO S , DESHPANDE M , et al. AlphaFold protein structure database: Massively expanding the structural coverage of protein-sequence space with high-accuracy models[J]. Nucleic Acids Research, 2022, 50(D1): D439- D444.
89
PDBe-KB consortium . PDBe-KB: A community-driven resource for structural and functional annotations[J]. Nucleic Acids Research, 2020, 48(D1): D344- D353.
90
MADEIRA F , MADHUSOODANAN N , LEE J , et al. The EMBL-EBI job dispatcher sequence analysis tools framework in 2024[J]. Nucleic Acids Research, 2024, 52(W1): W521- W525.
91
JONES P , BINNS D , CHANG H Y , et al. InterProScan 5: Genome-scale protein function classification[J]. Bioinformatics, 2014, 30(9): 1236- 1240.
92
MANFREDI M , VAZZANA G , SAVOJARDO C , et al. AlphaFold2 and ESMFold: A large-scale pairwise model comparison of human enzymes upon Pfam functional annotation[J]. Computational and Structural Biotechnology Journal, 2025, 27, 461- 466.
93
WOGNUM C , ASH J R , ALDEGHI M , et al. A call for an industry-led initiative to critically assess machine learning for real-world drug discovery[J]. Nature Machine Intelligence, 2024, 6(10): 1120- 1121.
94
PARDO-AVILA F, WEINER L, CABRAL P, et al. PDBCleanV2: A Python library for generating consistent structure datasets[EB/OL]. (2025-02-19)[2025-10-01]. https://doi.org/10.1101/2025.02.14.638326.
95
DURANT G , BOYLES F , BIRCHALL K , et al. The future of machine learning for small-molecule drug discovery will be driven by data[J]. Nature Computational Science, 2024, 4(10): 735- 743.
96
MASTROPIETRO A , PASCULLI G , BAJORATH J . Learning characteristics of graph neural networks predicting protein-ligand affinities[J]. Nature Machine Intelligence, 2023, 5(12): 1427- 1436.
97
ZHANG J , FEI Y , SUN L , et al. Advances and opportunities in RNA structure experimental determination and computational modeling[J]. Nature Methods, 2022, 19(10): 1193- 1207.
98
LIU C, WANG J, CAI Z, et al. Dynamic PDB: A new dataset and a SE(3) model extension by integrating dynamic behaviors and physical properties in protein structures[EB/OL]. (2024-08-22)[2025-10-01]. https://doi.org/10.48550/arxiv.2408.12413.
99
WIEHN T . Synthetic data: From data scarcity to data pollution[J]. Surveillance & Society, 2024, 22(4): 472- 476.
100
CALLAWAY E . What's next for AlphaFold and the AI protein-folding revolution[J]. Nature, 2022, 604(7905): 234- 238.
101
HUANG B , KONG L P , WANG C , et al. Protein structure prediction: Challenges, advances, and the shift of research paradigms[J]. Genomics, Proteomics & Bioinformatics, 2023, 21(5): 913- 925.
102
CRAMER P . AlphaFold2 and the future of structural biology[J]. Nature Structural & Molecular Biology, 2021, 28(9): 704- 705.
103
CALLAWAY E . AlphaFold is running out of data—so drug firms are building their own version[J]. Nature, 2025, 640(8058): 297- 298.
104
DALL'ALBA G , CASA P L , DE ABREU F P , et al. A survey of biological data in a big data perspective[J]. Big Data, 2022, 10(4): 279- 297.
105
VEENSTRA T D . Systems biology and multi-omics[J]. Proteomics, 2021, 21(3-4): 2000306.
106
CHEN M Y , BELL J M , SHI X D , et al. A complete data processing workflow for cryo-ET and subtomogram averaging[J]. Nature Methods, 2019, 16(11): 1161- 1168.
107
JOHNSON G T , AGMON E , AKAMATSU M , et al. Building the next generation of virtual cells to understand cellular biology[J]. Biophysical Journal, 2023, 122(18): 3560- 3569.
108
WATSON J L , JUERGENS D , BENNETT N R , et al. De novo design of protein structure and function with RFdiffusion[J]. Nature, 2023, 620(7976): 1089- 1100.
109
PAN J J, WANG J G, LI G L. Vector database management techniques and systems[C]//Companion of the 2024 International Conference on Management of Data. Santiago Chile: ACM, 2024: 597-604.
110
ZHONG L F , WU J , LI Q , et al. A comprehensive survey on automatic knowledge graph construction[J]. ACM Computing Surveys, 2024, 56(4): 1- 94.
111
ALSALLOUM G A , AL SAWAFTAH N M , PERCIVAL K M , et al. Digital twins of biological systems: A narrative review[J]. IEEE Open Journal of Engineering in Medicine and Biology, 2024, 5, 670- 677.

感谢清华大学王宏伟教授对论文提供的建设性意见。

RIGHTS & PERMISSIONS

All rights reserved. Unauthorized reproduction is prohibited.
PDF(9088 KB)

Accesses

Citation

Detail

Sections
Recommended

/