Normax Biomed Ltd (Normax) is based in Cork, Ireland, and London, England. Normax is in the business of mRNA vaccine Research, Development and Manufacturing. Normax has secured a €300,000,000 capital commitment from a $3.4Bn cornerstone institutional investor for development of mRNA Vaccines and Vax Factory Manufacturing for Transformative Social Impact on Infectious Disease and Pandemic Preparedness. Normax plans to drive down the cost of mRNA Vaccines to save more lives and to deliver sustainable returns for impact investors. Normax plans to deliver safe and effective mRNA vaccines at large scale for about $4 dollars per dose. Normax mRNA vaccine products in development include: (1) mRNA Vax Factory, (2) Universal Coronavirus mRNA Vaccine, (3) Tuberculosis mRNA Vaccine, (4) HIV mRNA Vaccine, (5) Malaria mRNA Vaccine, and (6) Disease-X mRNA Vaccine (e.g. within 100 days). Normax mission is to deliver competitive financial performance with transformative social impact. NOT AN OFFER TO INVEST.

NIH REPORT. National Human Genome Research Institute. Cost per Raw Megabase of DNA Sequence.

Wetterstrand KA. DNA Sequencing Costs: Data from the NHGRI Genome Sequencing Program (GSP)

DNA Sequencing Costs: Data

For many years, the National Human Genome Research Institute (NHGRI) has tracked the costs associated with DNA sequencing performed at the sequencing centers funded by the Institute. This information has served as an important benchmark for assessing improvements in DNA sequencing technologies and for establishing the DNA sequencing capacity of the NHGRI Genome Sequencing Program. Here, NHGRI provides an analysis of these data, which gives one view of the remarkable improvements in DNA sequencing technologies and data-production pipelines in recent years.

Overview

The cost-accounting data presented here are summarized relative to two metrics: (1) “Cost per Megabase of DNA Sequence” – the cost of determining one megabase (Mb; a million bases) of DNA sequence of a specified quality [see below]; (2) “Cost per Genome” – the cost of sequencing a human-sized genome. For each, a graph is provided showing the data since 2001; in addition, the actual numbers reflected by the graphs are provided in a summary table.

NHGRI welcomes people to download these graphs and use them in their presentations and teaching materials. NHGRI plans to update these data on a regular basis. You can view the data in in Excel by downloading the Sequencing Costs 2021.

Sequencing Cost Per Megabase — Sequencing cost per megabase – 2021

Graph: Sequencing Cost Per Genome — Cost per genome data – 2021

To illustrate the nature of the reductions in DNA sequencing costs, each graph also shows hypothetical data reflecting Moore’s Law, which describes a long-term trend in the computer hardware industry that involves the doubling of ‘compute power’ every two years (See: Moore’s Law [wikipedia.org]). Technology improvements that ‘keep up’ with Moore’s Law are widely regarded to be doing exceedingly well, making it useful for comparison.

In both graphs, note: (1) the use a logarithmic scale on the Y axis; and (2) the sudden and profound out-pacing of Moore’s Law beginning in January 2008. The latter represents the time when the sequencing centers transitioned from Sanger-based (dideoxy chain termination sequencing) to ‘second generation’ (or ‘next-generation’) DNA sequencing technologies. Additional details about these graphs are provided below.

These data, however, do not capture all of the costs associated with the NHGRI Large-Scale Genome Sequencing Program. The sequencing centers perform a number of additional activities whose costs are not appropriate to include when calculating costs for production-oriented DNA sequencing. In other words, NHGRI makes a distinction between ‘production’ activities and ‘non-production’ activities. Production activities are essential to the routine generation of large amounts of quality DNA sequence data that are made available in public databases; the costs associated with production DNA sequencing are summarized here and depicted on the two graphs. Additional information about the other activities performed by the sequencing centers is provided below.

Key Considerations

Cost Categories

The expenditures included in each category were established based on discussions between NHGRI staff and sequencing center personnel.

For the two graphs (“Cost per Megabase of DNA Sequence” and “Cost per Genome”), the following ‘production’ costs are accounted for:

Labor, administration, management, utilities, reagents, and consumables
Sequencing instruments and other large equipment (amortized over three years)
Informatics activities directly related to sequence production (e.g., laboratory information management systems and initial data processing)
Submission of data to a public database
Indirect Costs as they relate to the above items

In the case of costs covered by significant subsidies to a sequencing center (e.g., a grantee institution providing funds for purchasing large equipment), NHGRI has attempted to appropriately account for such costs in these analyses.

The costs associated with the following ‘non-production’ activities are not reflected in the two graphs:

Quality assessment/control for sequencing projects
Technology development to improve sequencing pipelines
Development of bioinformatics/computational tools to improve sequencing pipelines or to improve downstream sequence analysis
Management of individual sequencing projects
Informatics equipment
Data analysis downstream of initial data processing (e.g., sequence assembly, sequence alignments, identifying variants, and interpretation of results)

DNA Sequencing Technologies

In both graphs, the data from 2001 through October 2007 represent the costs of generating DNA sequence using Sanger-based chemistries and capillary-based instruments (‘first generation’ sequencing platforms). Beginning in January 2008, the data represent the costs of generating DNA sequence using ‘second-generation’ (or ‘next-generation’) sequencing platforms. The change in instruments represents the rapid evolution of DNA sequencing technologies that has occurred in recent years.

Quality

For the Sanger-based sequence data, the cost accounting reflects the generation of bases with a minimum quality score of Phred20(or Q20), which represents an error probability of 1 % and is an accepted community standard for a high-quality base. For sequence data generated with second-generation sequencing platforms, there is not yet a single accepted measure of accuracy; each manufacturer provides quality scores that are, at this time, accepted by the NHGRI sequencing centers as equivalent to or greater than Q20.

In the “Cost per Megabase of DNA Sequence” graph, the data reflect the cost of generating raw, unassembled sequence data; no adjustment was made for data generated using different instruments despite significant differences in the sequence read lengths. In contrast, the “Cost per Genome” graph does take these differences into account since sequence read length influences the ability to generate an assembled genome sequence.

Genome Coverage

The “Cost per Genome” graph was generated using the same underlying data as that used to generate the “Cost per Megabase of DNA Sequence” graph; the former thus reflects an estimate of the cost of sequencing a human-sized genome rather than the actual costs for specific genome-sequencing projects.

To calculate the cost for sequencing a genome, one needs to know the size of that genome and the required ‘sequence coverage’ (i.e., ‘sequence redundancy’) to generate a high-quality assembly of the genome given the specific sequencing platform being used. For generating the “Cost per Genome” graph, the assumed genome size was 3,000 Mb (i.e., the size of a human genome). The assumed sequence coverage needed differed among sequencing platforms, depending on the average sequence read length for that platform.

The following ‘sequence coverage’ values were used in calculating the cost per genome:

Sanger-based sequencing (average read length=500-600 bases): 6-fold coverage
454 sequencing (average read length=300-400 bases): 10-fold coverage
Illumina and SOLiD sequencing (average read length=75-150 bases): 30-fold coverage

For data since January 2008 (representing data generated using ‘second-generation’ sequencing platforms), the “Cost per Genome” graph reflects projects involving the ‘re-sequencing’ of the human genome, where an available reference human genome sequence is available to serve as a backbone for downstream data analyses. The required ‘sequence coverage’ would be greater for sequencing genomes for which no reference genome sequence is available.

References

Mardis E. A decade’s perspective on DNA sequencing technology. Nature, 470: 198-203. 2011. [PubMed]
Metzker M. Sequencing technologies – the next generation. Nature Genetics, 11: 31-46. 2010. [PubMed]
Stein L. The case for cloud computing in genome informatics. Genome Biology, 11: 207-213. 2010. [PubMed]

Human genome at ten: the sequence explosion. Nature, 464: 670-671. 2010. [PubMed]
NHGRI Genome Sequencing Program

How to Cite this Web Page:
Wetterstrand KA. DNA Sequencing Costs: Data from the NHGRI Genome Sequencing Program (GSP) Available at: www.genome.gov/sequencingcostsdata. Accessed [date of access].

This website includes “forward-looking statements” within the meaning of the “safe harbor” provisions of the United States Private Securities Litigation Reform Act of 1995. Forward-looking statements may be identified by the use of words such as “forecast,” “intend,” “seek,” “target,” “anticipate,” “believe,” “will,” “expect,” “estimate,” “plan,” “outlook,” and “project” and other similar expressions that predict or indicate future events or trends or that are not statements of historical matters. Such forward-looking statements include statements about our beliefs and expectations and the estimated financial information and other projections contained herein. Such forward-looking statements with respect to revenues, earnings, performance, strategies, prospects and other aspects of the businesses of Normax Biomed Ltd. are based on current expectations that are subject to risks and uncertainties. A number of factors could cause actual results or outcomes to differ materially from those expressed or implied by such forward-looking statements. Please refer to the final prospectus of Normax Biomed Limited under “Risk Factors” therein, and other documents filed or to be filed with the London Stock Exchange and the Swiss Stock Exchange (SIX) by Normax Biomed Ltd. You are cautioned not to place undue reliance upon any forward-looking statements, which speak only as of the date made. Normax Biomed Ltd. undertakes no commitment to update or revise the forward-looking statements, whether as a result of new information, future events or otherwise, except as required by law.

The information on this website shall not constitute a solicitation of a proxy, consent or authorization with respect to any securities or in respect of the proposed transaction. The information on this website shall also not constitute an offer to sell or the solicitation of an offer to buy any securities, nor shall there be any sale of securities in any states or jurisdictions in which such offer, solicitation or sale would be unlawful prior to registration or qualification under the securities laws of any such jurisdiction. No offering of securities shall be made except by means of a prospectus meeting the listing requirements of the London Stock Exchange and the Swiss Stock Exchange (SIX).

This website is directed only at, and provides information about products and services only available to, those who are Professional Clients or Eligible Counterparties as defined by the Financial Conduct Authority. The definitions can be found on the FCA website at www.fca.org.uk. This website is not intended to be accessed by any persons or entities domiciled in any jurisdiction where being treated as the types of clients stated would be contrary to local law.

Wetterstrand KA. DNA Sequencing Costs: Data from the NHGRI Genome Sequencing Program (GSP)

DNA Sequencing Costs: Data

Overview

Key Considerations

Cost Categories

DNA Sequencing Technologies

Quality

Genome Coverage

The following ‘sequence coverage’ values were used in calculating the cost per genome:

References

Forward Looking Statements. No Offer or Solicitation. Professional Investors Only.

Normax Abstract

Wetterstrand KA. DNA Sequencing Costs: Data from the NHGRI Genome Sequencing Program (GSP)

DNA Sequencing Costs: Data

Overview

Key Considerations

Cost Categories

DNA Sequencing Technologies

Quality

Genome Coverage

The following ‘sequence coverage’ values were used in calculating the cost per genome:

References

Share This Story, Choose Your Platform!

Forward Looking Statements. No Offer or Solicitation. Professional Investors Only.

Normax Abstract