The Impact of a Customized MALDI-TOF Library for Environmental

Transcription

The Impact of a Customized MALDI-TOF Library for Environmental
Technical Sheet
The Impact of a Customized
MALDI-TOF Library for Environmental
Microbial Identifications
Executive Summary
While a number of microbial identification systems are available to the pharmaceutical market for compendial testing, these platforms tend
to underperform, because they do not support the broad range of microorganisms encountered in environmental monitoring programs.
The reference libraries are often biased toward clinical, rather than environmental, organisms. A prime example is Bruker Daltonic’s
MALDI-TOF Biotyper® microbial identification platform. Here, we illustrate a systematic approach for the inclusion of new library entries,
with the priority on adding the most frequently encountered environmental species first. The additional entries created using this approach
increased the percentage of environmental isolates identified using the Biotyper platform from 73.1% to 83.7% (n=13,786 and 15,095).
While low cost and rapid turnaround time remain strong advantages for MALDI-TOF microbial identification platforms, our data demonstrate
that building a reference database with environmental organisms in mind is essential for its suitability to nonclinical applications.
Introduction
Methods
The Accugenix® brand supports the pharmaceutical, medical
device, nutraceutical, personal care and cosmetics industries,
helping to maintain environmental monitoring programs and
product failure investigations or other compendial testing. Having
identified more than a million samples over the past decade,
our services are well suited to examine the efficacy of microbial
identification platforms for environmental monitoring applications,
often using in-house 16S gene sequencing to compare and
evaluate systems. Since we first offered our AccuPRO-ID®
microbial identifications using Bruker Daltonic’s MALDI-TOF
Biotyper platform in 2010, we have processed over 70,000
samples on the system and continue to optimize the platform to
better accommodate environmental isolates. This study assesses
the suitability of the Biotyper platform for identifying environmental
isolates, demonstrates our systematic approach to broadening
the reference library and shows the impact of adding new library
entries on reportability for these industrial sectors.
All data presented in this study were generated from unknown
samples received for microbial identification. Samples submitted
for AccuPRO-ID® MALDI-TOF identification were processed
using a direct formic acid lysis method and/or the extract method
(per Accugenix SOPs and the manufacturer’s instructions) and
analyzed using Bruker’s MALDI-TOF and Biotyper software.
Samples with a “match factor” greater than 1.75 and separated
by >0.1 were ascribed probable species-level identifications. All
samples that failed to produce an identification using Biotyper
were submitted for AccuGENX-ID® 16S rDNA sequencing,
where sequence data were manually assembled and compared
against the validated and proprietary Accugenix 16S reference
library. Qualified data analysts determined identifications and
confidence levels. 16S-based sequence identification is used as
the reference method for this study.
North America: 1.877.274.8371 • Europe: 00 800 15 78 97 43
email: askcharlesriver@crl.com
www.criver.com
© 2013, Charles River Laboratories International, Inc.
Technical Sheet
Results
Figure 1: Diversity of Species Routinely Identified
Log (Frequency of Occurrence)
90%
95%
99%
Figure 3: Systematic Approach for the Inclusion of New
MALDI-TOF Library Entries
100%
100,000
Samples
10,000
1,000
Spotting
100
10
1
1
500
1000
1500
2000
MALDI-TOF MS
Species Count
The number of unique species identified by 16S sequencing
(x-axis) versus the frequency at which these species were
encountered (y-axis) (n=233,366). Vertical lines (dark blue)
designate the number of species that make up the top 90%, 95%,
99% and 100% of species identified. These results indicate that a
wide range of species are isolated from environmental sources.
Figure 2: Species Diversity of MALDI-TOF
Fall-Through Samples
Log (Frequency of Occurrence)
90%
95%
Spectra Acquisition /
Analysis
Library Comparison
99% 100%
1,000
100
Species-Level
Identification
10
1
1
500
1000
16S DNA
Sequencing
1500
Species Count
The number of unique species (x-axis) versus the number of times
these species failed to generate an identification (y-axis) using the
MALDI-TOF platform (n=12,199). Vertical lines (dark blue) indicate
the number of species that makeup the top 90%, 95%, 99% and
100% of fall-through species. These results indicate that when
the Biotyper system is used for environmental monitoring, the fallthrough samples represent an extensive number of species.
No Reportable
Identification
Report
Report & Identification
of MALDI-TOF
“Fall-Through” Samples
All samples that fail to generate an identification using the MALDITOF platform are submitted for 16S sequence identification. By
identifying all MALDI-TOF “fall-throughs,” we have the unique
opportunity to improve the MALDI-TOF Biotyper reference library.
Endotoxin and Microbial Detection
The Impact of a Customized MALDI-TOF
Library for Environmental Microbial Identifications
Technical Sheet
100
90
80
70
60
50
Missing
Present
40
30
20
10
0
Average
Figure 6: New Entries Target the Most Frequently Occurring
Species
Top 95% Encountered
Species (Percent)
Species (Percent)
Figure 4: Proportion of Fall-Through Species Present
and Missing from the Reference Library
100
90
80
70
60
50
40
30
20
10
0
Average - Weighted for
Frequency of Fall Through
The percent of MALDI-TOF fall-through species (y-axis)
represented by at least one MALDI-TOF library entry
(present=dark pink) or are not yet included in the library
(absent=light pink) (n=1,242). This comparison was completed
by disregarding the fall-through occurrence of each species (left),
as well as weighting for fall-through occurrence (right). These
data indicate that the inclusion of new species, and to a lesser
extent redundant species-level entries, is needed to improve the
reference library for environmental monitoring applications.
66.0%
96.6%
Biotyper
v3.3.1.2
Accugenix
v13.03
Missing
Present
Of the 559 species constituting the top 95% of species
encountered, the percent of species that are present (darker pink)
or absent (lighter pink) is compared against the manufacturer’s
library(left) and the Accugenix library (right). These data indicate
that library entries targeting the most frequently encountered
species seen are developed first, effectively tailoring the library
toward the environmental microorganisms important to the
industries we serve.
Figure 7: Source of Library Entries Leading to a
MALDI-TOF Identification
Figure 5: Number of Library Entries and Frequency
of Additions
Number of Library Entries
6000
5000
Biotyper
(63.1%)
4000
3000
1000
0
13
20
12
20
11
20
10
20
Year
Matching
Biotyper
(47.6%)
Discrepant
Biotyper
(18.1%)
Total Entries
Biotyper Entries
Accugenix Entries
2000
Accugenix
(36.9%)
No Other
Match
(34.3%)
The number of MALDI-TOF library entries (y-axis) created by the
manufacturer (orange), Accugenix (light blue) and combined
(dark blue), plotted versus time (x-axis). While the Accugenix
MALDI-TOF library currently only comprises ~15% of the entire
database, it is updated approximately 2-3 times more frequently
than the manufacturer’s updates (see entry markers) to include
microbes from environmental origins.
Of all samples that resulted in an identification (n=13,786), the
source of the top-matching library entry (Biotyper in dark pink,
Accugenix in light pink) was evaluated (left-hand pie chart).
Accugenix library entries were the top-match on 36.9% of all
reported identifications. Given that Accugenix entries make up
only ~15% of the entire library (Fig. 5) but are the top-match
on nearly 37% of all identifications, Accugenix entries are the
top match more than twice as often as Biotyper entries. Of all
samples reported with an Accugenix entry as the top match (righthand pie chart), Biotyper’s library would have reported the same
identification in 47.6% of the cases, but a discrepant identification
or no reportable identification in 18.1% and 34.3% of the cases,
respectively. Together, these data indicate that the Accugenix
library increases identification reportability and accuracy for
samples from environmental origins.
Endotoxin and Microbial Detection
The Impact of a Customized MALDI-TOF
Library for Environmental Microbial Identifications
Technical Sheet
Samples Tested (Percent)
Figure 8: Impact of a Customized Database on
Reportable Identifications
100
90
80
70
60
50
40
30
20
10
0
26.9%
16.3%
No Reportable
Identification
Reportable
Identification
Biotyper
v3.3.1.2
Accugenix
v13.03
Of all recent samples submitted for identification using the MALDITOF platform (n=15,095), the percent of samples resulting in no
reportable identification (light pink) and a reportable identification
(dark pink) was determined when using the manufacturer’s
database (left) and Accugenix’s customized database (right).
When using Accugenix’s library entries for environmental microbial
identifications, there is a 10.6% increase in reportable samples,
which translates into a 39.4% decrease in the non-reportable rate.
Discussion
While the number of microbial species frequently encountered in
the environmental arena is rather daunting (Fig. 1), inclusion of
these species in any microbial identification platform’s reference
library is critical to increasing reporting and/or improving a
platform’s accuracy for non-clinical applications. Using the
systematic approach presented here (Fig. 3), the species that
most frequently fail to generate an identification using Biotyper
are pinpointed by 16S-sequencing all fall-through samples
(Fig. 4), enabling the addition of these fall-through species to
the Accugenix reference library (Fig. 6). To date, these library
entries generate identifications more than twice as frequently
as the entries provided by the manufacturer, confirming that the
most frequent fall-throughs are added to the Accugenix reference
library first. On a similar note, these additional library entries
generate nearly 37% of all identifications, increase the number of
identifications by more than 10% and effectively reduce the nonreportable rate by 39.4% (Fig. 8). With future library releases, the
percentage of samples generating an identification will continue
to rise. That being said, the current percentage of fall-through
samples is 16.3%, which indicates that backing the MALDITOF platform with Accugenix 16S sequencing is still required
to reliably generate identifications for samples originating from
environmental sources.
Glossary of Terms
AccuGENX-ID® – genotypic identification method utilizing
comparative DNA sequencing of the 16S gene of bacteria for
identification with the highest accuracy and reportable rate
AccuPRO-ID® – polyphasic approach utilizing MALDI-TOF
and AccuGENX-ID 16S rDNA sequencing, assuring the highest
accuracy and reportable rate
Environmental monitoring – program designed to demonstrate
compliance and control of a manufacturing environment,
providing a baseline microbial profile of a facility and allowing for
tracking and trending to source contaminations or excursions
Genotypic – identification method using DNA sequence data or
fragment analysis
Library/Database – validated data set containing known
microorganism entries used to compare against unknown
microbial data for identifications
MALDI-TOF – matrix-assisted laser desorption/ionization timeof-flight mass spectrometry method which produces a ribosomal
protein spectrum that can be used for identification
Phenotypic – identification method using biochemical or
physiological data
Polyphasic – method consisting of two or more phases
Proteotypic – identification method that uses a ribosomal protein
spectral fingerprint
rDNA – DNA that encodes for the ribosomal RNA subunits in
an organism; after the rRNA sequence is transcribed from the
rDNA, it assembles into the ribosomal complex, e.g., the
bacterial 16S rRNA
Ribosome – complex molecule responsible for protein synthesis
consisting of multiple RNAs and proteins assembled into the
structure
North America: 1.877.274.8371 • Europe: 00 800 15 78 97 43
email: askcharlesriver@crl.com
www.criver.com
© 2013, Charles River Laboratories International, Inc.