Show sidebar Overview

Overview

Epoch AI’s Biology AI Models Dataset is a collection of machine learning models trained on biological data, and key information about their training. This dataset is useful for research about trends in the application of artificial intelligence to biology. This documentation describes which models are contained within the dataset and its records (including data fields and definitions). The dataset is available on our website as a visualization, and is available for download as a daily-updated CSV file.

If you would like to ask any questions about the database, or suggest a model that should be added, feel free to contact us at data@epoch.ai. If this dataset is useful for you, please cite it. To request access to data about biological model safeguards, please contact safeguards@epoch.ai.

Use This Work

Epoch’s data is free to use, distribute, and reproduce provided the source and authors are credited under the Creative Commons Attribution license.

Citation

Epoch AI, 'Data on AI Models'. Published online at epoch.ai. Retrieved from 'https://epoch.ai/data/biology-ai-models-documentation' [online resource]. Accessed 12 May 2026.

BibTeX Citation

@misc{EpochAIModels2025, title = {{Data on AI Models}}, author = {{Epoch AI}}, year = {2025}, month = {7}, url = {https://epoch.ai/data/biology-ai-models-documentation}, note = {Accessed: 12 May 2026} }

Python Import

import pandas as pd data_url = "https://epoch.ai/data/all_ai_models.csv" models_df = pd.read_csv(data_url)