Epoch AI’s Biology AI Models Dataset is a collection of machine learning models trained on biological data, and key information about their training. This dataset is useful for research about trends in the application of artificial intelligence to biology. This documentation describes which models are contained within the dataset and its records (including data fields and definitions). The dataset is available on our website as a visualization, and is available for download as a daily-updated CSV file.
If you would like to ask any questions about the database, or suggest a model that should be added, feel free to contact us at data@epoch.ai. If this dataset is useful for you, please cite it. To request access to data about biological model safeguards, please contact safeguards@epoch.ai.
Epoch’s data is free to use, distribute, and reproduce provided the source and authors are credited under the Creative Commons Attribution license.