An entry in this database contains the details of a GPU cluster at a point in time, typically the first date it became operational.
Many GPU clusters are upgraded or expanded over time with newer hardware. In these cases, we create a new record, following the procedure below, and reflecting the date when the upgrade was completed.
If an existing GPU cluster is upgraded in a way that substantially changes the cluster, we count this as a new GPU cluster and create a new entry. We do this when one of the criteria apply:
We mark that the later cluster builds on the former by linking them in the “Builds Upon” and “Superseded by” fields. (See the table of fields below for further details.)
We anonymized the data of Chinese systems by concealing names and rounding values of numerical fields to one significant figure. We do this to protect our public data sources and reduce the risk of owners redacting relevant information. We took this step in response to reports about reduced Chinese openness triggered by increased coverage in American think tank reports. We may grant some trusted researchers access to the full dataset upon request. Inquiries should be directed to data@epoch.ai.
| Column | Type | Definition | Example | Coverage |
|---|---|---|---|---|
Name | Text | The owner’s name, followed by the cluster name used in the most official announcement. | Oak Ridge NL Frontier | 100%
786 out of 786 records |
Status | Categorical (single select) | Existing: The supercomputer (as described in this entry) is believed to be operational (>80% of the GPUs available for training) | Existing | 100%
786 out of 786 records |
Certainty | Categorical (single select) | How likely does (or will) this cluster exist as roughly as specified? | Confirmed | 100%
786 out of 786 records |
Single cluster? | Categorical (single select) | Yes: The source clearly implies a single supercomputers deployed on a continuous campus without significant physical separation, connected by high-bandwidth networking fabric, forming a single system. There is no evidence otherwise | Yes | 99%
781 out of 786 records |
Chip type (primary) | Categorical (single select) | The primary AI chip used in the supercomputer. This links to a specific chip in the Epoch ML Hardware Database. | AMD Radeon Instinct MI250X | 59%
461 out of 786 records |
Chip quantity (primary) | Numerical | Total number of the primary AI chips in the supercomputer. | 37,632 | 81%
637 out of 786 records |
Hardware note | Text | Note to describe the all the AI chips in the supercomputer. | AMD MI250X | 61%
480 out of 786 records |
Max OP/s (log) | Numerical | The base 10 logrithm of the maximum theorhetical operations per second the superocmputer can achieve (out of 32, 16, or 8 bit performance) | 19.158756074028645 | 89%
703 out of 786 records |
16-bit OP/s (log) | Numerical | Base 10 logrithm of the maximum theorhetical performance of the supercomputer in 16-bit FLOP/s | 19.158756074028645 | 87%
682 out of 786 records |
H100 equivalents | Numerical | Divides the supercomputer’s Max OP/s field (in 32, 16, or 8 bit) by an H100’s FP8 FLOP/s. Note, H100 equivalents isn’t a well defined measurement, and this number should be treated more as a “sense” instead of a precise measurement. It works fairly well for comparisons among supercomputers since ~2021, but makes less sense for older supercomputers. | 7,283 | 89%
703 out of 786 records |
First Operational Date | Date | When the supercomputer was first fully operational (ie, you could run a workload on at least 80% of the cluster). This will often be an approximation to the nearest month or so, since the exact data a supercomputer became operational is rarely announced. By default, we are conservative with this date, and set it to the first date that we have confirmation that it was operational, which will often be a few months after it was actually first operational. | 2022-05-30 | 77%
608 out of 786 records |
Country | Categorical (single select) | Country in which the supercomputer is physically located. | United States of America | 89%
701 out of 786 records |
Owner | Categorical (multiple select) | The entity that owns the AI supercomputer itself (the hardware within the datacenter). This can include several entity if it is a joint venture. This can sometimes differ from the entity that owns the datacenter itself, or the entity that has a long term arrangement to rent the supercomputer. | US Department of Energy | 64%
502 out of 786 records |
Sector | Categorical (single select) | Private: Owned by a commercial entity. | Public | 98%
768 out of 786 records |
Power Capacity (MW) | Numerical | The peak power capacity of the system, in megawatts. Will be the reported power capacity if available, and the calculated power capacity if not. | 40 | 81%
636 out of 786 records |
Hardware Cost | Numerical | The cost to acquire the hardware for the system in 2025 US dollars. Includes the cost for the GPUs, CPUs, networking, etc, but not the datacenter itself. Will be the reported hardware cost if it exists, and the calculated hardware cost otherwise. | $620,445,966.19 | 68%
537 out of 786 records |
Energy Efficiency | Numerical | Log of the 16-bit FLOP/s per watt. The numerator is the theorhetical max 16-bit FLOP/s of the system, and the denomenator is the peak power capacity of the system. | 360326400000 | 77%
604 out of 786 records |
Rank when first operational | Numerical | This supercomputer’s rank in the world by performance (FLOP/s, maximum of 32, 16, or 8 bit precision) the day it first became operational | 2 | 69%
540 out of 786 records |
Location | Text | Specific location where the supercomputer is physicaly located. | Oak Ridge National Laboratory 5200, 1 Bethel Valley Rd, Oak Ridge, TN 37830 | 77%
607 out of 786 records |
Users | Categorical (multiple select) | Any significant entities known to use the supercomputer. “Cloud” refers to the supercomputer being offered publicly via the cloud. Will often just be the owner. | US Government, Academia, Industry | 60%
468 out of 786 records |
Quote | Text | A quote from the source specifying the number of and type of chips and/or the reported or manual FLOP/s number if applicable. | The system has 74 Olympus rack HPE cabinets, each with 128 AMD compute nodes, and a total of 9,408 AMD compute nodes… Each Frontier compute node consists of [1x] 64-core AMD “Optimized 3rd Gen EPYC” CPU (with 2 hardware threads per physical core) with access to 512 GB of DDR4 memory. Each node also contains [4x] AMD MI250X | 62%
484 out of 786 records |
Note | Text | Notes that give additional context or helpful information about the supercomputer. | (9408 nodes)x(4 GPUs/node) = 37,632 GPUs | 27%
213 out of 786 records |
First Operational Date Note | Text | When the supercomputer was first fully operational (ie, you could run a workload on at least 80% of the cluster). This will often be an approximation to the nearest month or so, since the exact data a supercomputer became operational is rarely announced. By default, we are conservative with this date, and set it to the first date that we have confirmation that it was operational, which will often be a few months after it was actually first operational. | 2022-05-30 | 67%
528 out of 786 records |
Certainty Note | Text | A briefly explanation of why the given certainty level was selected | Details released on official government website | 16%
123 out of 786 records |
Builds Upon | Categorical (single select) | If a supercomputer was built in multiple phases, this links to the supercomputer corresponding to the preceding phase. | 7%
57 out of 786 records | |
Superseded by | Categorical (single select) | If a supercomputer has a later phase (eg, was upgraded later in time), and that phase is already operational, this links to the supercomputer entry corresponding to that phase | 4%
31 out of 786 records | |
Possible duplicate | Boolean | True if we think there is a >20% chance that this supercomputer is a duplicate of a supercomputer that already exists in this database. In circumstances where we think two entries might correspond to the same supercomputer, we will mark this field False for the entry with which we have more information, and True for the entry with which we have less information (this allows the user to filter out all entries with “Possible Duplicate”=True for analysis) | 12%
94 out of 786 records | |
Possible Duplicate Of | Categorical (multiple select) | If “Possible duplicate” is marked True, this will link to the entry that we think this supercomputer is a duplicate of | 7%
55 out of 786 records | |
Chip type (secondary) | Categorical (single select) | The secondary AI chip used in the supercomputer, if there was any. This links to a specific chip in the Epoch ML Hardware Database. | 4%
29 out of 786 records | |
Chip quantity (secondary) | Numerical | Total number of the secondary AI chips in the supercomputer, if there are any. | NaN | 4%
31 out of 786 records |
Total number of AI chips | Numerical | Corresponds to the number of AI chips in the supercomputer. Maxium of {Primary AI chip quantity + Secondary AI chip quantity; Manual total chips number; number of chips calculated from FLOP/s} | 37,632 | 82%
647 out of 786 records |
GPU supplier (primary) | Categorical (single select) | The GPU supplier for the Primary AI Chip type | AMD | 100%
786 out of 786 records |
GPU supplier (secondary) | Categorical (single select) | The GPU supplier for the Secondary AI Chip type (if applicable) | 36%
286 out of 786 records | |
Exclude | Boolean | True if the entry should be excluded from analyses. This could be becase it falls below the inclusion threshold, or there is another reason for excluding it. False otherwise. | 0%
0 out of 786 records | |
Include in Standard Analysis | Boolean | True if this entry qualified to be used in the analyses we used for the original paper. That is, (Status=“Existing” or “Decomissioned”) AND (Exclude=False) AND (Certainty = “Confirmed” or “Likely”) AND (Possible Duplicate=False) AND (Single Cluster?=“Yes”) | True | 61%
482 out of 786 records |
Max OP/s | Numerical | The maximum theorhetical operations per second the superocmputer can achieve (out of 32, 16, or 8 bit performance) | 1.4e+19 | 89%
703 out of 786 records |
8-bit OP/s | Numerical | The maximum theorhetical operations per second the superocmputer can achieve in 8-bit numerical precision | 1.4e+19 | 51%
397 out of 786 records |
16-bit OP/s | Numerical | The maximum theorhetical operations per second the superocmputer can achieve in 16-bit numerical precision | 1.4e+19 | 87%
682 out of 786 records |
32-bit OP/s | Numerical | The maximum theorhetical operations per second the superocmputer can achieve in 32-bit numerical precision | 3.6e+18 | 71%
558 out of 786 records |
Calculated Power Capacity (MW) | Numerical | The calculated power capacity of the supercomputer in megawatts based on the number and type of chips. See paper for full methodology | 40.066790399999995 | 77%
607 out of 786 records |
Reported power capacity (MW) | Text | Reported power capacity of the supercomputer in megawatts | 40 | 17%
132 out of 786 records |
Calculated Cost | Numerical | Calculates the cost (in 2025 US dollars) based on number and type of chips. See paper for full methodology | $967,488,702.64 | 68%
536 out of 786 records |
Reported Cost | Numerical | Reported total cost of the supercomputer in US dollars at the time. | $600,000,000 | 4%
33 out of 786 records |
Reported Cost (Inflation adjusted) | Numerical | Reported total cost of the supercomputer, adjusted to 2025 US dollars | $620,445,966.19 | 4%
30 out of 786 records |
Cost Quote | Text | Quote from the source saying how much the supercomputer cost. Might refer to a currency besides dollars, or not yet be inflation adjusted | $600M | 4%
34 out of 786 records |
Largest existing cluster when first operational | Text | The name of the largest known supercomputer existing on the first operaitonal date of this supercomputer | Microsoft GPT-4 cluster | 69%
540 out of 786 records |
% of largest cluster when first operational | Numerical | The maximum theorhetical FLOP/s (32, 16, or 8 bit) for the this supercomputer, divided by the maximum theorhetical FLOP/s for the largest known supercomputer existing at the time. | 0.9239138461538378 | 69%
540 out of 786 records |
Source 1 (through Source n) | URL | URL for source on the AI supercomputer. Generally saved as a Wayback Machine link to preserve the link | 66%
522 out of 786 records |
Have a question? Noticed something wrong? Let us know.
Our database of over 500 GPU clusters and supercomputers tracks large hardware facilities, including those used for AI training and inference.