Discrimination of Coffee Bean Species Based on Aroma Compounds

Applications | 2026 | ShimadzuInstrumentation

GC/MSD, GC/MS/MS, GC/QQQ, Software

Industries

Food & Agriculture

Manufacturer

Shimadzu

Go to the library View PDF

Summary

Significance of the topic

Accurate and rapid discrimination of coffee species (primarily Coffea arabica vs. Coffea canephora — Robusta) matters for quality control, detection of adulteration, flavor profiling, and supply-chain integrity in coffee production and trading. Aroma compound profiling by GC-MS/MS combined with targeted database matching and chemometric modeling offers a practical alternative to more laborious approaches (e.g., solvent extraction and NMR markers), delivering faster workflows, higher sensitivity, and direct linkage between chemical markers and sensory descriptors.

Objectives and overview of the study

This application study evaluated whether headspace solid-phase microextraction (HS-SPME) coupled to triple-quadrupole GC-MS/MS can reliably discriminate Arabica and Robusta roasted coffee beans. Specific aims were to: (1) obtain comprehensive volatile profiles using a Smart Aroma Database for identification and MRM quantification; (2) assess separation and characteristic compounds using multivariate statistics (PCA, PLS-DA, clustering); and (3) build and validate a classification model (SVM) to assign unknown samples to species.

Methodology

Sample preparation and acquisition

- Commercial roasted coffee beans from four Arabica brands and two Robusta brands were milled; 1 g aliquots were sealed in screw vials and analyzed in triplicate.
- Volatiles were concentrated by HS-SPME and introduced directly to the GC-MS/MS with no solvent extraction or concentration steps.

Chromatography and mass spectrometry conditions (concise)

- Instrumentation: GCMS-TQ8040 NX triple-quadrupole MS with AOC-6000 autosampler.
- Column: SH-I-5Sil MS (30 m × 0.25 mm I.D., 0.25 µm).
- Injection: splitless, vaporizer 250 °C.
- Oven program: 50 °C hold 0–5 min, ramp 10 °C/min to 250 °C (final hold to 35 min total).
- Carrier: helium. Ionization: EI; interface 250 °C; ion source 200 °C.
- Acquisition modes: full-scan (m/z 35–400) for exploratory alignment and targeted MRM using Smart Aroma Database for higher sensitivity.

Data processing and chemometrics

- Peak alignment and exploratory analysis: Signpost MS (alignment assigned ~210 compounds from scan data).
- Targeted MRM identifications and quantification used Smart Aroma Database (contains ~500 aroma compounds with MRM transitions); 178 database compounds were identified in MRM mode and 175 compounds without missing values were used for multivariate modeling.
- Multivariate workflows in eMSTAT Solution: unsupervised PCA, supervised PLS-DA, hierarchical clustering and dendrograms, visualization (loading plots, box plots), and supervised classification using Support Vector Machine (SVM) with leave-one-brand-out validation.

Used instrumentation

GCMS-TQ8040 NX triple quadrupole gas chromatograph–mass spectrometer
AOC-6000 multifunctional autosampler (HS-SPME automation)
SH-I-5Sil MS GC column (30 m × 0.25 mm, 0.25 µm)
Smart Aroma Database (GC-MS(/MS) with MRM conditions and sensory descriptors)
LabSolutions Insight GCMS (for chromatogram review)
Signpost MS (alignment and peak assignment)
eMSTAT Solution (statistical analysis, PCA, PLS-DA, SVM model building)

Main results and discussion

- Exploratory scan data: alignment produced ~210 features and hierarchical clustering separated Arabica and Robusta into distinct clusters, indicating species-specific volatile patterns are present and detectable with HS-SPME GC-MS.
- Targeted MRM data: 178 aroma compounds from the Smart Aroma Database were identified; using 175 complete variables, PCA and PLS-DA produced clear separation by species along PC1 and in supervised models, confirming reproducible chemical differences with improved sensitivity and selectivity in MRM mode.
- Characteristic markers: Arabica samples were relatively enriched in compounds such as 5-methylfurfural, acetoin, and furaneol acetate — compounds associated with roasted, caramel, and creamy notes. Robusta was comparatively enriched in p-vinylguaiacol, a compound linked to medicinal/clove-like off-notes and bitterness, supporting sensory distinctions between species.
- Visualization and interpretation: eMSTAT Solution box plots and loading plots enabled straightforward identification of compounds contributing most to PC1 separation; these link analytical signals to sensory descriptors from the Smart Aroma Database.
- Classification performance: an SVM discrimination model built with three Arabica brands plus two Robusta brands and tested in a leave-one-brand-out manner correctly classified each withheld Arabica brand as Arabica with scores ≥85 (several at 100), demonstrating high accuracy and practical discriminative power for sample-level assignment.

Benefits and practical applications

Discrimination of Coffee Bean Species Based on Aroma Compounds

Summary

Significance of the topic

Objectives and overview of the study

Methodology

Used instrumentation

Main results and discussion

Benefits and practical applications

Future trends and possibilities for use

Conclusion

Reference

Similar PDF

Key words

Key words

Key words

Key words