Introduction: Population-level substance use research primarily relies on self-reports, which often underestimate actual use. Hair analyses offer a more objective estimate; however, longitudinal studies examining concordance are lacking. Previous studies showed that specific psychological and behavioral characteristics are associated with a higher likelihood of underreporting substance use, but the longitudinal stability of these associations remains unclear. We compared the prevalence of illegal and non-medical prescription substance use assessed with self-reports and hair analyses and predicted underreporting across two time points. Methods: Data were drawn from a community cohort study. At the first time point, the sample with self-report and hair analysis comprised 1,002 participants (Mage = 20.6 [SD = 0.38] years, 50.2% female), of which 761 (Mage = 24.5 [SD = 0.38] years, 48.3% female) also provided hair at the second time point. We compared substance use 3-month prevalence rates assessed by self-reports and hair analyses for the most frequent substances cannabis/tetrahydrocannabinol (THC), amphetamines, Ecstasy/3,4-methylenedioxymethamphetamine (MDMA), cocaine, ketamine, codeine, and opioid painkillers. Binary logistic regressions were conducted to test behavioral and psychological predictors of underreporting. Results: Self-reported past-year prevalence rates of non-medical substance use were high, specifically for cannabis (56% prevalence rate at age 20/49% at age 24), Ecstasy (13%/14%), codeine (13%/11%), cocaine (12%/13%), and opioid painkillers (4%/11%). Comparing self-report and hair-analysis 3-month prevalence rates over time, consistent underreporting (similar underreporting rates between time points and investigation of false negatives) was observed for daily cannabis (22%/23%), Ecstasy/MDMA (41%/52%), cocaine (30%/60%), ketamine (61%/72%), and codeine use (48%/51%). Underreporting of Ecstasy/MDMA, cocaine, ketamine, and opioid painkillers significantly increased. Contrarily, weekly to daily cannabis (31%/18%), amphetamine (95%/11%), and opioid painkiller use (12%/66%) were overreported. Hair analysis-derived 3-month prevalence rates of cocaine (9%/23%) and ketamine (2%/6%) strongly increased over time, while decreasing for codeine (11%/8%). Balanced accuracies were higher for hair analysis compared to self-reports for daily cannabis, Ecstasy/MDMA, cocaine, ketamine, and codeine but lower for weekly to daily cannabis and amphetamines, while fairly similar for opioid painkillers. Accuracy metrics were largely stable for cannabis measures but partially varied over time for other substances, which was likely driven by the large changes in underreporting. False negative reports were associated across both time points, indicating an intra-individual consistency of underreporting. At both time points, delinquency and attention-deficit hyperactivity disorder symptoms were associated with an increased likelihood of accurately reporting cocaine use, while internalizing symptoms increased the likelihood of accurately reporting codeine use. Conclusion: Consistent and changeable underreporting emphasizes the importance of objective substance use assessments, specifically for studies investigating cocaine, Ecstasy/MDMA, ketamine, and codeine.

Substance use is a global concern [1], being associated with physical and mental health problems [2]. Adolescents and young adults are at increased risk of substance use and substance use disorders [3]. However, measuring substance use poses challenges. Substance use can be captured either by self-reports that are prone to reporting biases or by biological markers, such as hair analyses or urine tests, which seem to be more objective data but come with limitations, such as sensitivity.

The accuracy and reliability of self-reports and hair analyses have been investigated cross-sectionally but not yet at two time points for a community-based cohort of young adults [4‒7]. However, measuring substance use across two time points can indicate stability and changes in substance use behavior over time. Consequently, a longitudinal perspective is crucial to identifying trends and patterns, providing a better understanding of these (intra-individual) dynamics to inform prevention and intervention strategies. The present study thus reports substance use prevalence rates (12-month and 3-month) among females and males aged 24 in a cohort in Switzerland (i) to draw comparisons to prevalence rates at age 20 [8], (ii) to assess the accuracy and reliability of self-reports and hair analyses at two measurement points (ages 20 and 24), and (iii) to investigate who underreports substance use across both time points [5].

Prevalence

Substance use is particularly prevalent among young adults. Cross-sectional survey data reveal that by age 21, a significant majority of young adults in countries like New Zealand (62%) and the USA (54%) have used cannabis [9]. UN agencies have reported increased use between 2011 and 2021 not only of alcohol, tobacco, and other illegal substances (e.g., cannabis, hallucinogens, stimulants, opioids) but also of non-medical prescription medications (e.g., opioid painkillers, tranquilizers), with global substance use rising by 23% over the past decade, resulting in one in every 17 people worldwide having used a substance in 2021 [1]. However, cannabis remains the most widely (illegally) used substance globally, followed by non-medical use of prescribed medication [1].

Several data sources provide an overview of substance use in Switzerland [8, 10‒15]. These studies, particularly the z-proso project (on which the present study is based), reveal frequent substance use among young adults (e.g., 57% use of cannabinoids in the past year) [8]. High prevalence rates in studies are supported by wastewater analyses, which indicate that Swiss cities have among the highest wastewater concentrations of cocaine and 3,4-methylenedioxymethamphetamine (MDMA, “Ecstasy”) in Europe [16, 17].

Additionally, sex differences have been extensively documented. While males tend to exhibit higher rates of illegal substance use, females are more likely to engage in the non-medical use of prescription medications [1, 8]. But even though males use more substances overall, females progress more quickly from substance use to addiction, making them more vulnerable to substance use disorders [18].

Measuring Substance Use

Although studies on substance use prevalence rates to date have been crucial in advancing successful prevention and intervention strategies, they have limitations. First, policy and intervention planning require up-to-date information about dynamically changing substance use prevalence, which quickly renders older data obsolete. Second, studies have primarily relied on self-report questionnaires. These are essential for gathering information on substance use patterns, frequency, and intensity. Still, they rely on individuals’ willingness and ability to recall and report substance use behaviors accurately. Self-reports may be especially prone to recall bias due to fear of stigmatization or consequences associated with illegal substance use [19] but also possible intoxication effects [20], depending on the setting of the study.

Hair data can address these limitations as it offers a biological marker for substances and their metabolites, which are incorporated into the hair following consumption and can be detected in the hair roots and shafts as hair grows. Hair analysis can thus provide a retrospective timeline of substance use over recent months [21] and might be a more reliable and accurate method less susceptible to biases. Unlike urine or blood testing, hair samples can be gathered quickly and in most settings [22, 23]. While highly reliable for detecting cocaine, MDMA, and opioid use, the sensitivity of hair analysis is lower for cannabis, where only weekly to daily use can be reliably detected [24]. Additionally, tetrahydrocannabinol (THC) hair concentrations can also be influenced by external contamination [25, 26]. Several studies have investigated the concordance of self-reports and biological markers, including HAs. A recent systematic review found generally high agreement between self-reports and biological markers in clinical study settings. However, others have reported notable discrepancies between self-reports and hair analyses, with the use of certain substances underestimated. For example, at the community-representative level, anonymized [5], who lay the foundation for this study with repeated measures, reported underestimated use of illegal substances in self-reports compared to hair analyses among young adults in the z-proso cohort; MDMA, cocaine, ketamine, amphetamines, codeine, and opioid painkiller use was underreported by 30%–60%.

Underreporting of Substance Use

Self-reported behavioral and psychological characteristics may influence the accuracy of substance use self-reports [5]. For example, for codeine and/or cocaine use, higher levels of self-reported delinquency, attention-deficit hyperactivity disorder (ADHD) symptoms, internalizing symptoms, and lower levels of self-control are associated with less underreporting. One study using four repeated measures over 1 year reported findings of specific characteristics influencing underreporting (i.e., sex, age, ethnicity) but also reported overreporting of the substance used most often in recent months or that caused the most difficulties [27].

Current Study

Studies investigating the concordance of self-reports and biological markers have used cross-sectional designs; many focused on clinical or high-risk samples or relied on urine or blood tests. However, these approaches might not represent substance use in a more general population and do not account for intra-individual variability across time. Knowing about factors influencing reporting accuracy, such as memory recall, social desirability, or changes in usage patterns, can improve the accuracy of substance use assessments and inform the design of effective preventions and interventions. Therefore, this study uses data from two time points at ages 20 and 24 from a community-based sample to gain insights into substance use (12-month and 3-month prevalence rates) and their patterns’ temporal dynamics and assess the concordance of self-reports and hair analyses at these two time points.

We examined the following research questions (RQ) and hypotheses (H): RQ1.1: What are the 12-month self-report prevalence rates of cannabinoids (THC, cannabidiol, cannabis substitutes/synthetic cannabinoids), cocaine, amphetamine, substituted amphetamines (i.e., methamphetamine, MDMA), methylphenidate, hallucinogens (lysergic acid diethylamide (LSD)/psilocybin, 2C substances, ketamine), medical and non-medical opioids (codeine, opiate painkillers, heroin), dextromethorphan, Z-drugs, and tranquilizers (benzodiazepines) among 24-year-olds in the z-proso sample? H1.1: We expected to find high self-report prevalence rates (similar to age 20; [8]) among 24-year-olds, particularly for cannabis, cocaine, Ecstasy, and codeine [8, 28]. RQ1.2: How do prevalence rates of substance use among 24-year-olds differ between sexes? H1.2: We expected a higher rate of illegal substance use among men and a higher use of non-medical use of prescribed drugs among women [8, 28]. RQ2.1: How does the test-retest reliability of accuracy metrics (positivity and agreement rates, sensitivity, specificity, kappa values) between self-report substance use and hair analysis change over time, and how does this vary among substances? Since this is exploratory, we only assume the following hypothesis: H2.1: We expected consistent underreporting for most substances [4‒6]. RQ2.2: How does the test-retest reliability for the answer patterns (agreement, false negatives, false positives) between self-report substance use and hair analysis change over time, and how does this vary among substances? H2.2: Again, exploratory; we expected a certain consistency among the answer patterns. RQ3: What predicts (longitudinally) underreporting of substance use among the 24-year-olds? H3: We expected self-reported delinquency, ADHD, low self-control, and internalizing symptoms to be associated with less underreporting [5].

Recruitment and Participants

Data came from the Zurich Project on Social Development from Childhood to Adulthood (z-proso) [29]. In 2004, 56 out of 90 public elementary schools with 1,675 children were randomly selected in the city of Zurich. The selection process included stratification based on school size and districts’ socio-economic backgrounds, ensuring the sample closely mirrored the demographic structure of first-graders at that time in Zurich’s public schools. The first assessment included 1,360 pupils at age 7. Since then, eight additional assessment waves have been conducted, with the last two carried out in 2018 (wave 8; age 20; n = 1,180) and 2022 (wave 9; age 24; n = 1,160). In addition to completing the anonymous survey, participants were invited to provide 3 cm of hair to detect psychoactive substances stored in it for the previous 3 months at waves 8 and 9. If scalp hair was shorter than 1 cm, arm, leg, or chest hair was provided. A total of 1,002 hair samples were obtained at wave 8, 887 hair samples were obtained at wave 9, and 762 participants donated hair samples at both waves 8 and 9. However, one participant was excluded because of missing data on all relevant variables, resulting in an overlapping sample at waves 8 and 9 of 761 participants. The number of participants most likely decreased due to the online participation option that more participants took advantage of after COVID-19. The online participants did not donate a hair sample. Online supplementary Tables S1 and S2 (for all online suppl. material, see https://doi.org/10.1159/000541713) in the supplement show that both wave 8 and 9 samples (all vs. only both time points) are similar in their demographics.

Table 1 shows that the sample is evenly split in terms of sex-assigned-at-birth and consistent with Switzerland’s immigration policies. It is a multi-ethnic sample with a high percentage of participants having a migration background. The study was conducted in accordance with the Declaration of Helsinki. The Cantonal Ethics Committee Zurich (BASEC #2017-02021) and the Ethics Committee of the Faculty of Arts and Social Sciences, University of Zurich, approved the study. Participants provided written informed consent and received compensation for participation in the survey and the hair donation.

Table 1.

Sample characteristics and descriptive statistics of the main study variables wave 8 and wave 9

VariableWave 8Wave 9
nitemsaαb% (n)mean (SD)nitemsaαb% (n)mean (SD)
Sociodemographics 
Age, years 1,002    20.57 (0.38) 761    24.44 (0.38) 
Sex 1,002     761     
 Female    50.2 (503)     48.8 (371)  
 Male    49.8 (499)     51.2 (390)  
Parental socio-economic status (ISEI) 956    47.1 (19.8) 731    47.7 (20.0) 
Parental educational degree (highest in household) 804     620     
 University degree    30.6 (246)     30.6 (190)  
 Other    69.4 (558)     69.4 (430)  
Participant highest educational degree 1,002     760     
 Compulsory and preparatory vocational    24.0 (240)     7.6 (58)  
 Vocational    49.3 (494)     53.9 (410)  
 Academic    26.7 (268)     38.4 (292)  
Participants’ place of birthc 804     620     
 Participants born in Switzerland    90.4 (727)     91.3 (566)  
 Participants born abroad    9.6 (77)     8.7 (54)  
Parental migration background 984     746     
 Both parents born in Switzerland    25.0 (246)     28.2 (210)  
 One parent born abroad    27.6 (272)     28.0 (209)  
 Both parents born abroad    47.4 (466)     43.8 (327)  
Parental place of birth Switzerland 981     746     
   52.7 (517)     56.3 (420)  
 European Union and other European countries    27.8 (272)     26.0 (194)  
 Asia (including Turkey)    20.3 (199)     20.0 (152)  
 Former Yugoslavia    16.1 (158)     13.8 (103)  
 Latin America    6.9 (68)     7.6 (57)  
 Sub-Saharan Africa    4.2 (42)     3.4 (26)  
 Northern Africa    2.4 (24)     2.0 (15)  
 USA, Canada, New Zealand, Australia    2.1 (21)     2.3 (17)  
Psychological and behavioral correlates 
Physical aggression 1,002 0.86  1.20 (0.46) 760 0.84  1.10 (0.32) 
ADHD symptoms 1,002 0.79  2.72 (0.76) 760 0.80  2.85 (0.78) 
Internalizing symptoms 1,002 15 0.92  2.19 (0.75) 760 15 0.92  2.25 (0.75) 
Delinquency 1,002 24   2.56 (2.25) 760 24   2.26 (1.80) 
Low self-control 1,002 10 0.74  2.07 (0.42) 760 10 0.73  1.90 (0.40) 
Hair sample characteristics 
Hair type 1,002     761     
 Scalp    91.1 (913)     90.0 (686)  
 Other (arm, leg, chest)    8.9 (89)     10.0 (76)  
Weight of hair assessed, mg 1,002    12.76 (4.74) 761    14.47 (5.27) 
Hair treatment 1,002          
 Participants with hair bleaching    19.6 (196)     14.7 (112)  
 Participants without hair bleaching    80.4 (806)     85.3 (648)  
Hair color 1,001     761     
 Light    23.6 (236)     13.9 (106)  
 Brown    54.0 (541)     65.3 (497)  
 Dark    22.4 (224)     20.8 (158)  
VariableWave 8Wave 9
nitemsaαb% (n)mean (SD)nitemsaαb% (n)mean (SD)
Sociodemographics 
Age, years 1,002    20.57 (0.38) 761    24.44 (0.38) 
Sex 1,002     761     
 Female    50.2 (503)     48.8 (371)  
 Male    49.8 (499)     51.2 (390)  
Parental socio-economic status (ISEI) 956    47.1 (19.8) 731    47.7 (20.0) 
Parental educational degree (highest in household) 804     620     
 University degree    30.6 (246)     30.6 (190)  
 Other    69.4 (558)     69.4 (430)  
Participant highest educational degree 1,002     760     
 Compulsory and preparatory vocational    24.0 (240)     7.6 (58)  
 Vocational    49.3 (494)     53.9 (410)  
 Academic    26.7 (268)     38.4 (292)  
Participants’ place of birthc 804     620     
 Participants born in Switzerland    90.4 (727)     91.3 (566)  
 Participants born abroad    9.6 (77)     8.7 (54)  
Parental migration background 984     746     
 Both parents born in Switzerland    25.0 (246)     28.2 (210)  
 One parent born abroad    27.6 (272)     28.0 (209)  
 Both parents born abroad    47.4 (466)     43.8 (327)  
Parental place of birth Switzerland 981     746     
   52.7 (517)     56.3 (420)  
 European Union and other European countries    27.8 (272)     26.0 (194)  
 Asia (including Turkey)    20.3 (199)     20.0 (152)  
 Former Yugoslavia    16.1 (158)     13.8 (103)  
 Latin America    6.9 (68)     7.6 (57)  
 Sub-Saharan Africa    4.2 (42)     3.4 (26)  
 Northern Africa    2.4 (24)     2.0 (15)  
 USA, Canada, New Zealand, Australia    2.1 (21)     2.3 (17)  
Psychological and behavioral correlates 
Physical aggression 1,002 0.86  1.20 (0.46) 760 0.84  1.10 (0.32) 
ADHD symptoms 1,002 0.79  2.72 (0.76) 760 0.80  2.85 (0.78) 
Internalizing symptoms 1,002 15 0.92  2.19 (0.75) 760 15 0.92  2.25 (0.75) 
Delinquency 1,002 24   2.56 (2.25) 760 24   2.26 (1.80) 
Low self-control 1,002 10 0.74  2.07 (0.42) 760 10 0.73  1.90 (0.40) 
Hair sample characteristics 
Hair type 1,002     761     
 Scalp    91.1 (913)     90.0 (686)  
 Other (arm, leg, chest)    8.9 (89)     10.0 (76)  
Weight of hair assessed, mg 1,002    12.76 (4.74) 761    14.47 (5.27) 
Hair treatment 1,002          
 Participants with hair bleaching    19.6 (196)     14.7 (112)  
 Participants without hair bleaching    80.4 (806)     85.3 (648)  
Hair color 1,001     761     
 Light    23.6 (236)     13.9 (106)  
 Brown    54.0 (541)     65.3 (497)  
 Dark    22.4 (224)     20.8 (158)  

ADHD, attention-deficit/hyperactivity disorder; ISEI, International Socio-Economic Index of Occupational Status.

aNumber of items used to compute multi-item scales.

bCronbach’s alpha for multi-item scales based on study sample.

cNumbers total more than 100% because information on both parents is included.

Measures

Self-Reported Substance Use

An extensive list of substances was given to the participants, including cannabinoids, stimulants, hallucinogens, and non-medical and medical opioids (complete list in online suppl. Table S3). Participants indicated how often they had consumed each substance in their life as well as in the past year (1 = never; 2 = once, 3 = 2–5 times [quarterly]; 4 = 6–12 times [monthly]; 5 = 13–52 [weekly] 6 = 53–365 times [daily]), and within the past 3 months (0 = never; 1 = once; 2 = 2–5 times [quarterly], 3 = weekly, 4 = [almost] daily). Regarding prescription drugs, participants were asked to report only non-medical use (i.e., more frequently or at higher doses than prescribed).

Hair Toxicological Analyses

The substances and their metabolites were quantified using liquid chromatography-tandem mass spectrometry, which has been described using different data [30, 31] and in the supplement of the wave 8 cross-sectional investigation [5], on which this study is based. Participants completed an additional questionnaire on over-the-counter medications and variables that could affect hair toxicology analyses (e.g., hair bleaching [0 = no, 1 = yes], hair color [light, brown, dark]).

Psychological and Behavioral Correlates of Underreporting, Self-Reported at Age 20 and 24

The following subscales from the Social Behavior Questionnaire (SBQ) [32, 33] were included: physical aggression, ADHD, and internalizing symptoms. On a 5-point Likert scale, ranging from 1 = never to 5 = very often, participants reported how frequently they had experienced or engaged in these behaviors within the last year for physical aggression and ADHD symptoms and within the previous month for internalizing symptoms. Delinquency was assessed using a 24-item dichotomous checklist [34, 35], including minor and major delinquent acts, such as graffitiing and assault. For this scale, a sum score was computed. Finally, self-control was measured using a 10-item adaptation of the Grasmick Self-Control Scale [35, 36]. On a 4-point scale ranging from 1 = fully untrue to 4 = fully true, participants reported on items such as Sometimes I do dangerous things just for the fun of it.

Sociodemographic Variables

Sex-assigned-at-birth and parental migration background were measured as binary variables (0 = female, 1 = male; 0 = at least one parent born in Switzerland; 1 = both parents born abroad). Information was gathered in W4 to W6 for the migration background variable. International Socio-Economic Index of Occupational Status scores were derived from parental occupation and ranged from 16 (e.g., unskilled worker) to 90 (e.g., judge) [37]. The regression models used participants’ highest educational degree as a dummy variable, including compulsory schooling and preparatory vocational education, academic education, and vocational degrees (as reference category).

Statistical Analysis

All analyses were conducted in SPSS 29 [38] and R [39]. For data visualization, ggplot2 [40] was used. Prevalence estimates were computed separately for females and males for the entire sample. Group differences were tested using a χ2-test. Dummy variables were created to compare the reliability of self-reported substance use and its detection in hair analyses to assess the reliability of the two methods. These variables were coded as 0 for no self-reported use and no substance detection in hair and 1 for self-reported use and substance detection in hair, based on the concentration of the substance or its metabolites exceeding the limit of quantification. This coding was applied independently to both self-report data and hair analysis results. However, due to a programming error in the filter structure of the questionnaire, the 3-month self-reports for MDMA were not assessed at wave 8 and could thus only be reported at wave 9. In the cross-sectional investigation [5], a “by proxy” prevalence estimation of MDMA was calculated by using self-reported ratios from 3- and 12-month prevalence rates of cocaine and amphetamine use, which often occur in similar contexts such as parties.

McNemar tests were used to investigate reliability and compare self-report and hair analysis prevalence estimates based on the dummy variables. We calculated the following diagnostic accuracy metrics: positivity rate, under-/overreporting rate, specificity, sensitivity, balanced accuracy, detection ratio, agreement, and kappa for the reported substances at both time points [21] (Table 2). An explanation of these statistics can be found in wave 8 [5] (online suppl. Table S4). The metrics for the time points were compared using z-scores, except for the detection ratio, which cannot be used for proportion tests. For the accuracy metrics, we used (1) the entire sample participating at both time points (n = 761) and (2) an optimal sample of participants who provided ≥3 cm of scalp hair that weighed ≥5 mg (n = 613) (online suppl. Tables S5–S7). Similarly, χ2 tests were conducted to analyze the test-retest reliability of the self-reports, hair analyses, agreement (identical coding for both methods, 0/0 and 1/1 vs. 0/1 and 1/0), false positives (negative hair analysis but positive self-report), and false negatives (positive hair analysis but negative self-report). Online supplementary Table S8 provides an overview of the test-retest variables and answer patterns.

Table 2.

Comparison of self-report (SR) and hair analysis (HA): substance use during the previous three months for wave 8 [5] and wave 9 with the full sample

WaveSubstanceSample sizeaPositive HA, % (n)Positive SR, % (n)Under-/over-reportingDetection ratioAgreementKappaHA specificityHA sensitivityHA balanced accuracySR specificitySR sensitivitySR balanced accuracy
W8 Cannabis weekly/dailyc 1,001 14.2 (142) 18.6 (186) 31.0 (o) 0.76 90.4 0.65 96.8 62.4 79.6 91.9 81.7 86.8 
W9 Cannabis weekly/dailyc 759 14.0 (106) 16.5 (125) 17.9 (o) 0.85 90.9 0.65 96.1 64.8 80.5 93.3 76.4 84.9 
z-test   −0.12 −1.14 −6.24*  0.36 −0.13 −0.79 1.04 0.44 1.10 −2.72** −1.16 
W8 Cannabis dailyc 1,001 14.2 (142) 11.1 (111) 21.8 (u) 1.28 91.7 0.63 93.6 76.6 85.1 97.0 59.9 78.5 
W9 Cannabis dailyc 759 14.0 (106) 10.8 (82) 22.6 (u) 1.29 91.3 0.60 93.4 74.4 83.9 96.8 57.5 77.2 
z-test   −0.12 −0.20 0.41  −0.30 −1.07 −0.17 −1.07 −0.69 −0.24 −1.01 −0.65 
W8 Amphetamines 1,002 1.9 (19) 3.7 (37) 94.7 (o) 0.51 96.6 0.38 99.2 29.7 64.5 97.4 57.9 77.7 
W9 Amphetamines 760 2.4 (18) 2.6 (20) 11.1 (o) 0.90 97.4 0.46 98.8 45.0 71.9 98.5 50.0 74.3 
z-test   0.72 −1.29 −35.80*  0.97 3.50* −0.85 6.61* 3.31* 1.59 −3.30** −1.66 
W8 MDMA/Ecstasy 1,002 12.2 (122) 7.2d 41.1 (u)d 1.70d —  — — — — — — 
W9 MDMA/Ecstasy 761 12.1 (92) 5.8 (44) 52.2 (u) 2.09 92.4 0.54 92.6 88.6 90.6 99.3 42.4 70.9 
z-test   −0.06 −1.17 4.63*  — — — — — — — — 
W8 Cocaine 1,001 9.4 (94) 6.6 (66) 29.8 (u) 1.42 93.6 0.57 95.1 72.7 83.9 98.0 51.1 74.6 
W9 Cocaine 760 22.8 (173) 9.2 (70) 59.5 (u) 2.47 84.3 0.44 83.9 88.6 86.3 98.6 35.8 67.2 
z-test   7.76* 2.02*** 12.5*  −6.33* −5.41* −7.85* 8.20* 1.37 0.95 −6.40* −3.38* 
W8 Ketamine 1,002 2.3 (23) 0.9 (9) 60.9 (u) 2.56 98.2 0.43 98.4 77.8 88.1 99.8 30.4 65.1 
W9 Ketamine 760 6.2 (47) 1.7 (13) 72.3 (u) 3.62 95.3 0.38 95.3 92.3 93.8 99.9 25.5 84.9 
z-test   4.15* 1.5 5.03*  −3.51* −1.99*** −3.82* 8.23* 2.42* −6.46* −2.08* 8.42* 
W8 Codeineb 1,002 11.3 (113) 5.9 (59) 47.8 (u) 1.92 88.6 0.28 91.1 49.2 70.2 96.6 25.7 61.2 
W9 Codeineb 760 7.5 (57) 3.7 (28) 50.9 (u) 2.04 92.5 0.30 94.1 50.0 72.1 98.0 24.6 61.3 
z-test   −2.67** −2.11*** 1.29  2.74** 0.60 2.35*** 0.33 0.87 1.77 −0.53 0.06 
W8 Opioid painkillersb 1,001 2.6 (26) 2.9 (29) 11.5 (o) 0.90 95.9 0.23 98.0 24.1 61.1 97.7 26.9 62.3 
W9 Opioid painkillersb 759 3.8 (29) 6.3 (48) 65.5 (o) 0.60 92.2 0.20 97.2 18.8 58.0 94.7 31.0 62.9 
z-test   1.43 0.36 23.54*  −3.32* −1.97*** −1.10 −2.67** −1.29 −3.35* 1.88 0.24 
WaveSubstanceSample sizeaPositive HA, % (n)Positive SR, % (n)Under-/over-reportingDetection ratioAgreementKappaHA specificityHA sensitivityHA balanced accuracySR specificitySR sensitivitySR balanced accuracy
W8 Cannabis weekly/dailyc 1,001 14.2 (142) 18.6 (186) 31.0 (o) 0.76 90.4 0.65 96.8 62.4 79.6 91.9 81.7 86.8 
W9 Cannabis weekly/dailyc 759 14.0 (106) 16.5 (125) 17.9 (o) 0.85 90.9 0.65 96.1 64.8 80.5 93.3 76.4 84.9 
z-test   −0.12 −1.14 −6.24*  0.36 −0.13 −0.79 1.04 0.44 1.10 −2.72** −1.16 
W8 Cannabis dailyc 1,001 14.2 (142) 11.1 (111) 21.8 (u) 1.28 91.7 0.63 93.6 76.6 85.1 97.0 59.9 78.5 
W9 Cannabis dailyc 759 14.0 (106) 10.8 (82) 22.6 (u) 1.29 91.3 0.60 93.4 74.4 83.9 96.8 57.5 77.2 
z-test   −0.12 −0.20 0.41  −0.30 −1.07 −0.17 −1.07 −0.69 −0.24 −1.01 −0.65 
W8 Amphetamines 1,002 1.9 (19) 3.7 (37) 94.7 (o) 0.51 96.6 0.38 99.2 29.7 64.5 97.4 57.9 77.7 
W9 Amphetamines 760 2.4 (18) 2.6 (20) 11.1 (o) 0.90 97.4 0.46 98.8 45.0 71.9 98.5 50.0 74.3 
z-test   0.72 −1.29 −35.80*  0.97 3.50* −0.85 6.61* 3.31* 1.59 −3.30** −1.66 
W8 MDMA/Ecstasy 1,002 12.2 (122) 7.2d 41.1 (u)d 1.70d —  — — — — — — 
W9 MDMA/Ecstasy 761 12.1 (92) 5.8 (44) 52.2 (u) 2.09 92.4 0.54 92.6 88.6 90.6 99.3 42.4 70.9 
z-test   −0.06 −1.17 4.63*  — — — — — — — — 
W8 Cocaine 1,001 9.4 (94) 6.6 (66) 29.8 (u) 1.42 93.6 0.57 95.1 72.7 83.9 98.0 51.1 74.6 
W9 Cocaine 760 22.8 (173) 9.2 (70) 59.5 (u) 2.47 84.3 0.44 83.9 88.6 86.3 98.6 35.8 67.2 
z-test   7.76* 2.02*** 12.5*  −6.33* −5.41* −7.85* 8.20* 1.37 0.95 −6.40* −3.38* 
W8 Ketamine 1,002 2.3 (23) 0.9 (9) 60.9 (u) 2.56 98.2 0.43 98.4 77.8 88.1 99.8 30.4 65.1 
W9 Ketamine 760 6.2 (47) 1.7 (13) 72.3 (u) 3.62 95.3 0.38 95.3 92.3 93.8 99.9 25.5 84.9 
z-test   4.15* 1.5 5.03*  −3.51* −1.99*** −3.82* 8.23* 2.42* −6.46* −2.08* 8.42* 
W8 Codeineb 1,002 11.3 (113) 5.9 (59) 47.8 (u) 1.92 88.6 0.28 91.1 49.2 70.2 96.6 25.7 61.2 
W9 Codeineb 760 7.5 (57) 3.7 (28) 50.9 (u) 2.04 92.5 0.30 94.1 50.0 72.1 98.0 24.6 61.3 
z-test   −2.67** −2.11*** 1.29  2.74** 0.60 2.35*** 0.33 0.87 1.77 −0.53 0.06 
W8 Opioid painkillersb 1,001 2.6 (26) 2.9 (29) 11.5 (o) 0.90 95.9 0.23 98.0 24.1 61.1 97.7 26.9 62.3 
W9 Opioid painkillersb 759 3.8 (29) 6.3 (48) 65.5 (o) 0.60 92.2 0.20 97.2 18.8 58.0 94.7 31.0 62.9 
z-test   1.43 0.36 23.54*  −3.32* −1.97*** −1.10 −2.67** −1.29 −3.35* 1.88 0.24 

MDMA, 3,4-methylenedioxymethamphetamine.

Bold values indicate significant z-scores.

aIncludes for wave 9 only cases with valid data from both hair samples and self-reports at both time points.

bCorrected for self-reported medical use.

cHair toxicology analysis can typically detect only regular or intense exposure to cannabis. Therefore, we provide comparisons of hair tests with frequent self-reported cannabis use using the categories of “weekly to daily use” vs. “less/no use” and of “daily use” vs. “less/no use.”

dFor MDMA/Ecstasy, 3-month self-reports were not available, and we estimated the self-report prevalence (see Methods for details); data based on estimated prevalence are given in italics.

*p < 0.001.

**p < 0.01.

***p < 0.05.

To assess the (longitudinal) influence of response biases on underreporting, binary logistic regressions were conducted, focusing on behavioral and psychological predictors of underreporting of cocaine and codeine. These two substances were considered representations of illegal and medical substances with notable baseline prevalence. The regression models were adjusted for the concentrations of the substances and their metabolites found in hair, allowing for unintentional substance exposure of exposure from over 3 months prior.

Prevalence Rates and Sex Differences

The 12-month self-reported prevalence rates (Fig. 1, online suppl. Table S9) of substance use at age 24 in the total sample (nWave9 = 987) are high. Cannabis is the most-used substance (47.4%), followed by cannabidiol products (24.9%). Substances with rates above 10% are codeine, MDMA, cocaine, opioid painkillers, and LSD/psilocybin (only in males). LSD/psilocybin, medical stimulants, benzodiazepines, amphetamines, z-drugs (only in females), and cannabis substitutes (only in males) have been non-medically used by more than 5% in the past year.

Fig. 1.

Past-year prevalence and frequency of substance use at age 20 (wave 8; from [6]) and age 24 (wave 9) in the entire sample for each time point, overall (a) and sex-specific (b). DXM, dextromethorphan.

Fig. 1.

Past-year prevalence and frequency of substance use at age 20 (wave 8; from [6]) and age 24 (wave 9) in the entire sample for each time point, overall (a) and sex-specific (b). DXM, dextromethorphan.

Close modal

In general, more males reported illegal substance use compared to females. In contrast, females tended to report more non-medical use of prescribed medications than males.

The appendix contains 3-month prevalence rates of hair analysis, which confirm highly prevalent substance use (n = 761). It also includes past-year prevalence rates for the self-report/hair analysis sample and statistical testing of sex differences (online suppl. Fig. S1, S2; Tables S10, S11).

Changes in Substance Use

Changes in 3-month prevalence rates between ages 20 and 24 have been tested for the use of cannabis, amphetamines, MDMA/Ecstasy, cocaine, ketamine, codeine, and opioid painkillers (Table 2). Self-report-derived prevalence rates significantly increased for cocaine and decreased for codeine. Hair analysis-derived prevalence rates of cocaine and ketamine significantly increased over time, while it was decreasing for codeine.

Concordance of Self-Reports and Hair Analyses

Test-Retest Reliability for Accuracy Metrics

Comparing 3-month prevalence rates of self-report and hair analysis over time (Table 2), consistent underreporting (similar underreporting rates between time points, measured through z-tests, and investigation of false negatives through χ2 tests) was observed for daily cannabis, Ecstasy/MDMA, cocaine, ketamine, and codeine use. Underreporting of Ecstasy/MDMA, cocaine, ketamine, and opioid painkillers significantly increased from age 20 to 24. Contrarily, weekly to daily cannabis, amphetamine, and opioid painkiller use was consistently overreported. The detection ratio increased for all substances from wave 8 to wave 9, except for opioid painkillers. The agreement rate decreased for cocaine, ketamine, and opioid painkillers but increased for codeine. Kappa values also decreased for amphetamine, cocaine, ketamine, and opioid painkillers. Like in wave 8, most hair analyses demonstrated higher sensitivity but lower specificity than self-reports, suggesting that hair analysis more often confirmed positive results from self-reports than vice versa. However, significant differences in kappa values were found again for amphetamines, cocaine, ketamine, and opioid painkillers. Except for opioid painkillers, hair sensitivity increased significantly at wave 9 for all substances. Self-report specificity did not change for most substances across time, except for a decrease in ketamine and opioid painkiller use. Furthermore, self-report sensitivity was similar to hair analysis sensitivity, which, in most cases, was lower than the specificity. It also decreased for frequent cannabis, amphetamines, and cocaine use but increased for ketamine use. Finally, balanced accuracy was higher for most substances in the hair analyses than the self-reports. Similar results were found when running the analyses with the optimal sample only. The most substantial difference was found for self-report sensitivity for ketamine, which was lower in the optimal sample (online suppl. Table S5).

Test-Retest Reliability for Answer Patterns

Each method’s test-retest reliability revealed statistically significant kappa values for all substances, indicating specific measurement stability across time for each method separately (Table 3). Participants had similar answer patterns across time (Table 4). When investigating the agreement rates (self-report and hair analysis were either positive or negative) across time, a high concordance rate for all substances was found. Self-reports and hair analysis aligned at waves 8 and 9 for most participants, reinforcing the credibility of both methods for capturing substance use. However, several participants moved from the no-agreement group to the agreement group and vice versa between the time points, meaning initial agreement was not always maintained. The highest rates of no agreement at both time points were found for cannabis and codeine.

Table 3.

Stability of self-reports and hair analyses for each method separately across time

SubstanceSample sizeNo use at W8 and W9, % (n)Use at W8 and W9, % (n)No use at W8, use at W9, % (n)Use at W8, no use at W9, % (n)χ2PhiKappa
Self-report 
Cannabis (weekly) 760 93.3 (583) 62.2 (84) 6.7 (42) 37.8 (51) 247.280* 0.570* 0.570* 
Cannabis (daily) 760 95.5 (644) 61.6 (53) 4.5 (30) 38.4 (33) 256.289* 0.581* 0.581* 
Amphetamine 760 98.4 (720) 28.6 (8) 1.6 (12) 71.4 (20) 76.342* 0.317* 0.312* 
Cocaine 759 93.9 (663) 49.1 (26) 6.1 (43) 50.9 (27) 110.122* 0.381* 0.377* 
Ketamine 760 98.8 (742) 44.4 (4) 1.2 (9) 55.6 (5) 98.929* 0.361* 0.355* 
Codeine 760 97.1 (691) 14.6 (7) 2.9 (21) 85.4 (41) 17.152* 0.150* 0.144* 
Opioid painkillers 758 94.2 (695) 20.0 (4) 5.8 (43) 80.0 (16) 6.726** 0.094** 0.086** 
Hair analysis 
Cannabis 759 94.6 (614) 64.5 (71) 5.4 (35) 35.5 (39) 273.909* 0.601* 0.601* 
Amphetamine 761 98.1 (731) 25.0 (4) 1.9 (14) 75.0 (12) 36.258* 0.218* 0.218* 
MDMA/Ecstasy 761 92.4 (622) 46.6 (41) 7.6 (51) 53.4 (47) 111.451* 0.383* 0.383* 
Cocaine 761 83.1 (571) 78.4 (58) 16.9 (116) 21.6 (16) 143.233* 0.434* 0.384* 
Ketamine 761 94.9 (706) 58.8 (10) 5.1 (38) 41.2 (7) 81.149* 0.327* 0.284* 
Codeine 761 93.6 (630) 15.9 (14) 6.4 (43) 84.1 (74) 10.179* 0.116* 0.112* 
Opioid painkillers 761 96.6 (717) 21.2 (4) 3.4 (25) 78.9 (15) 15.804*** 0.144* 0.141* 
SubstanceSample sizeNo use at W8 and W9, % (n)Use at W8 and W9, % (n)No use at W8, use at W9, % (n)Use at W8, no use at W9, % (n)χ2PhiKappa
Self-report 
Cannabis (weekly) 760 93.3 (583) 62.2 (84) 6.7 (42) 37.8 (51) 247.280* 0.570* 0.570* 
Cannabis (daily) 760 95.5 (644) 61.6 (53) 4.5 (30) 38.4 (33) 256.289* 0.581* 0.581* 
Amphetamine 760 98.4 (720) 28.6 (8) 1.6 (12) 71.4 (20) 76.342* 0.317* 0.312* 
Cocaine 759 93.9 (663) 49.1 (26) 6.1 (43) 50.9 (27) 110.122* 0.381* 0.377* 
Ketamine 760 98.8 (742) 44.4 (4) 1.2 (9) 55.6 (5) 98.929* 0.361* 0.355* 
Codeine 760 97.1 (691) 14.6 (7) 2.9 (21) 85.4 (41) 17.152* 0.150* 0.144* 
Opioid painkillers 758 94.2 (695) 20.0 (4) 5.8 (43) 80.0 (16) 6.726** 0.094** 0.086** 
Hair analysis 
Cannabis 759 94.6 (614) 64.5 (71) 5.4 (35) 35.5 (39) 273.909* 0.601* 0.601* 
Amphetamine 761 98.1 (731) 25.0 (4) 1.9 (14) 75.0 (12) 36.258* 0.218* 0.218* 
MDMA/Ecstasy 761 92.4 (622) 46.6 (41) 7.6 (51) 53.4 (47) 111.451* 0.383* 0.383* 
Cocaine 761 83.1 (571) 78.4 (58) 16.9 (116) 21.6 (16) 143.233* 0.434* 0.384* 
Ketamine 761 94.9 (706) 58.8 (10) 5.1 (38) 41.2 (7) 81.149* 0.327* 0.284* 
Codeine 761 93.6 (630) 15.9 (14) 6.4 (43) 84.1 (74) 10.179* 0.116* 0.112* 
Opioid painkillers 761 96.6 (717) 21.2 (4) 3.4 (25) 78.9 (15) 15.804*** 0.144* 0.141* 

W8, wave 8, W9, wave 9; for each substance, the percentages for “no use at W8 and W9” and “no use at W8, use at W9” sum to 100%, and the percentages for “use at W8 and W9” and “use at W8, no use at W9” also sum to 100%.

*p < 0.001.

**p < 0.05.

***p < 0.01.

Table 4.

Stability of agreement (self-reports and hair analyses) across time

SubstanceSample sizeNo agreement at W8 and W9, % (n)Agreement at W8 and W9, % (n)No agreement at W8, agreement at W9, % (n)Agreement at W8, no agreement at W9, % (n)χ2PhiKappa
Agreement 
Cannabis 758 26.5 (18) 93.3 (644) 6.7 (46) 73.5 (68) 31.404* 0.204* 0.203* 
Amphetamine 760 15.0 (3) 96.9 (717) 3.1 (23) 85.0 (17) 8.335** 0.105** 0.104** 
Cocaine 759 17.6 (21) 95.6 (612) 4.4 (28) 82.4 (98) 29.268* 0.196* 0.175* 
Ketamine 760 8.3 (3) 98.8 (715) 1.2 (9) 91.7 (33) 11.094* 0.121* 0.104* 
Codeine 758 29.8 (17) 89.9 (632) 10.1 (71) 70.2 (40) 20.037* 0.162* 0.158* 
Opioid painkillers 758 8.6 (5) 96.7 (677) 3.3 (23) 91.4 (53) 4.285*** 0.075*** 0.070*** 
SubstanceSample sizeNo agreement at W8 and W9, % (n)Agreement at W8 and W9, % (n)No agreement at W8, agreement at W9, % (n)Agreement at W8, no agreement at W9, % (n)χ2PhiKappa
Agreement 
Cannabis 758 26.5 (18) 93.3 (644) 6.7 (46) 73.5 (68) 31.404* 0.204* 0.203* 
Amphetamine 760 15.0 (3) 96.9 (717) 3.1 (23) 85.0 (17) 8.335** 0.105** 0.104** 
Cocaine 759 17.6 (21) 95.6 (612) 4.4 (28) 82.4 (98) 29.268* 0.196* 0.175* 
Ketamine 760 8.3 (3) 98.8 (715) 1.2 (9) 91.7 (33) 11.094* 0.121* 0.104* 
Codeine 758 29.8 (17) 89.9 (632) 10.1 (71) 70.2 (40) 20.037* 0.162* 0.158* 
Opioid painkillers 758 8.6 (5) 96.7 (677) 3.3 (23) 91.4 (53) 4.285*** 0.075*** 0.070*** 

Agreement: W8 = wave 8, W9 = wave 9; self-reports and hair analyses both positive or both negative. For each substance, “agreement at W8 and W9” and “no agreement at W8, agreement at W9” sum to 100%, and the percentages for “no agreement at W8 and W9” and “agreement at W8, no agreement at W9” also sum to 100%.

*p < 0.001.

**p < 0.01.

***p < 0.05.

False negatives (positive hair analysis but negative self-report) were consistent across time for most substances, except amphetamine and opioid painkillers; false positives (negative hair analysis but positive self-report) were consistent across time for cannabis and amphetamines (Table 5). The significant relationship between false negatives for cannabis, cocaine, ketamine, and codeine at both time points indicates that the discrepancy between self-reports and hair analysis does not occur by chance. A certain consistency in incorrect self-reporting can be found. The same applies to false positives for cannabis and amphetamines.

Table 5.

Stability of false negatives and false positives (self-reports and hair analyses) across time

SubstanceSample sizeNo false negatives at W8 and W9, % (n)False negative at W8 and W9, % (n)No false negative at W8, false negative at W9, % (n)False negative at W8, no false negative at W9, % (n)χ2PhiKappa
False negatives 
Cannabis 758 97.8 (718) 16.7 (4) 83.3 (20) 2.2 (16) 18.986*** 0.158*** 0.158*** 
Amphetamine 760 99.1 (744) 0.0 (0) 100.0 (9) 0.9 (7) 0.85 −0.011 −0.010 
Cocaine 759 97.2 (630) 14.4 (16) 85.6 (95) 2.8 (18) 29.990*** 0.199*** 0.163*** 
Ketamine 760 99.0 (718) 8.6 (3) 91.4 (32) 1.0 (7) 14.875*** 0.140*** 0.115*** 
Codeine 760 92.7 (665) 27.9 (12) 72.1 (31) 7.3 (52) 22.441*** 0.172*** 0.168*** 
Opioid painkillers 758 98.4 (726) 5.0 (1) 95.0 (19) 1.6 (12) 1.315 0.042 0.041 
SubstanceSample sizeNo false negatives at W8 and W9, % (n)False negative at W8 and W9, % (n)No false negative at W8, false negative at W9, % (n)False negative at W8, no false negative at W9, % (n)χ2PhiKappa
False negatives 
Cannabis 758 97.8 (718) 16.7 (4) 83.3 (20) 2.2 (16) 18.986*** 0.158*** 0.158*** 
Amphetamine 760 99.1 (744) 0.0 (0) 100.0 (9) 0.9 (7) 0.85 −0.011 −0.010 
Cocaine 759 97.2 (630) 14.4 (16) 85.6 (95) 2.8 (18) 29.990*** 0.199*** 0.163*** 
Ketamine 760 99.0 (718) 8.6 (3) 91.4 (32) 1.0 (7) 14.875*** 0.140*** 0.115*** 
Codeine 760 92.7 (665) 27.9 (12) 72.1 (31) 7.3 (52) 22.441*** 0.172*** 0.168*** 
Opioid painkillers 758 98.4 (726) 5.0 (1) 95.0 (19) 1.6 (12) 1.315 0.042 0.041 
Sample sizeNo false positives at W8 and W9, % (n)False positive at W8 and W9, % (n)No false positive at W8, false positive at W9, % (n)False positive at W8, no false positive at W9, % (n)χ2PhiKappa
False positives 
Cannabis 758 95.5 (682) 27.3 (12) 72.7 (32) 4.5 (32) 39.373*** 0.228*** 0.228*** 
Amphetamine 760 97.7 (732) 18.2 (2) 81.8 (9) 2.3 (17) 11.261*** 0.122*** 0.117*** 
Cocaine 759 98.0 (736) 0.0 (0) 100.0 (8) 2.0 (15) 0.163 −0.015 −0.014 
Ketamine 760 99.7 (757) 0.0 (0) 100.0 (1) 0.3 (2) 0.003 −0.002 −0.002 
Codeine 760 96.9 (723) 7.1 (1) 92.9 (13) 3.1 (23) 0.741 0.031 0.030 
Opioid painkillers 758 98.2 (707) 5.3 (2) 94.7 (36) 1.8 (13) 2.225 0.054 0.048 
Sample sizeNo false positives at W8 and W9, % (n)False positive at W8 and W9, % (n)No false positive at W8, false positive at W9, % (n)False positive at W8, no false positive at W9, % (n)χ2PhiKappa
False positives 
Cannabis 758 95.5 (682) 27.3 (12) 72.7 (32) 4.5 (32) 39.373*** 0.228*** 0.228*** 
Amphetamine 760 97.7 (732) 18.2 (2) 81.8 (9) 2.3 (17) 11.261*** 0.122*** 0.117*** 
Cocaine 759 98.0 (736) 0.0 (0) 100.0 (8) 2.0 (15) 0.163 −0.015 −0.014 
Ketamine 760 99.7 (757) 0.0 (0) 100.0 (1) 0.3 (2) 0.003 −0.002 −0.002 
Codeine 760 96.9 (723) 7.1 (1) 92.9 (13) 3.1 (23) 0.741 0.031 0.030 
Opioid painkillers 758 98.2 (707) 5.3 (2) 94.7 (36) 1.8 (13) 2.225 0.054 0.048 

W8 = wave 8, W9 = wave 9; false negatives: hair analysis is positive, but self-report is negative; false positives: hair analysis is negative, but self-report is positive. For each substance, “no false negative/positive at W8 and W9” and “false negative/positive at W8, no false negative/positive at W9” sum to 100%, and the percentages for “false negative/positive at W8 and W9” and “no false negative/positive at W8, false negative/positive at W9” also sum to 100%.

***p < 0.001.

Correlates of Underreporting

Consistent with findings from wave 8 [5], regression analyses revealed that sociodemographic variables were not associated with the underreporting of cocaine and codeine use in wave 9 (online suppl. Tables S12, S13). Still, cross-sectionally higher delinquency and ADHD symptoms were associated with lower odds of underreporting cocaine use. Higher levels of internalizing symptoms were found to decrease the odds of underreporting codeine use. Further analysis, including only individuals who did not report medical use of codeine, was not possible due to the reduced sample size. Longitudinally, only higher levels of internalizing symptoms were associated with lower odds of underreporting codeine (online suppl. Table S13).

This study examined the 12-month prevalence of self-report, the sex differences in prevalence rates, the concordance between 3-month prevalence rates of self-report and hair analysis, and predictors of underreporting using two waves of a community-representative sample of substance use among young adults in Switzerland aged 20 at wave 8 and 24 at wave 9.

Prevalence Rates

In line with our H1.1, we found higher prevalence rates across the last 12 and last 3 months for the 24-year-olds compared to 4 years earlier for many substances, with a particular rise in cocaine and opioid painkiller use. Additionally, in line with H1.2, males reported higher illegal substance use, whereas females reported higher non-medical prescription medication use. Hair analysis corroborated these results, which align with previous findings [1].

We found strikingly high prevalence rates for cocaine use (3 months: 9.4% in self-report and 22.8% in hair analysis), which increased more than 2-fold in hair analysis between ages 20 and 24. Also, ketamine use showed a 3-fold increase in hair. These numbers support previous findings indicating either a later age of onset for cocaine and ketamine use compared to substances such as cannabis or MDMA [9] or a general increase in use in the population. Nevertheless, the sample is Zurich-based, implying cocaine, ketamine, MDMA, methamphetamine, and cannabis use is high in the city, as confirmed by wastewater analysis [41].

Concordance of Self-Reports and Hair Analyses

Across time, underreporting is highly consistent but also significantly increased for MDMA (caution: underreporting was only estimated for age 20), cocaine, and ketamine (Table 2). Also, overreporting was consistent, which decreased for weekly to daily cannabis use and amphetamines but increased for opioid painkillers. The increase in underreporting of cocaine use might be explained by the stigma of the substance, while ketamine might be ingested unintentionally in the context of cocaine or MDMA use. The changes in overreporting of weekly to daily cannabis use and amphetamines might be explained by measurement problems of hair analysis specifically for these two substance groups [5].

The observed increase in the detection ratio between waves 8 and 9 for most substances indicates a shift in substance use responses as the detection methods did not change over the studied period, supporting H2.1. Declining agreement rates for several substances further imply a divergence in self-reported substance use compared to hair analysis. Only codeine stands out with an increased agreement rate and a decrease in use, which might reflect stronger controlled codeine access. As of January 1, 2019, access to codeine and codeine- or dihydrocodeine-containing drugs in Swiss pharmacies has been more restricted as the acquisition must be discussed with and documented by a pharmacist [42]. Therefore, participants using codeine in 2022 may have had a better understanding of the substance they used than in 2018.

The decline in kappa values from wave 8 to wave 9 for most substances underscores the growing discrepancy between self-reported use and hair analysis. This trend is particularly noticeable with substances such as amphetamines and cocaine. Both substances are linked to social stigma and can thus affect the accuracy of self-reports [19, 43]. The substantial decrease in self-report sensitivity combined with increased hair analysis sensitivity might further support participants’ reluctance to disclose specific substance use. However, the consistent specificity of self-report (except for ketamine and opioid painkillers) demonstrates that even though numerous participants underreport substance use, when participants do report use, it tends to be reliable.

Test-Retest Reliability for Answer Patterns

Both methods demonstrate notable measurement stability, supporting H2.1. The consistency in participants’ response patterns reinforces this conclusion. However, that some participants changed their answers over time and shifted between agreement and disagreement groups suggests that initial agreement cannot absolutely predict future concordance or vice versa.

Moreover, stability across false negative and false positive patterns underscores the self-report and hair analysis data discrepancies, confirming H2.2. The consistent patterns of false negatives for codeine, cannabis, ketamine, and cocaine indicate that the use of hair analysis is highly relevant. Furthermore, the consistent patterns of false positives for amphetamines and cannabis might point to limitations in hair analysis. THC, in particular, is more difficult to detect than other illegal substances [24]. These patterns of false negatives and positives suggest a systematic bias rather than a random error in reporting, potentially driven by external factors like stigma and recall bias. However, it is also important to consider that the underreporting of substance use may also be influenced by the adulteration of specific street substances with another undeclared psychotropic agent [44], which users cannot recognize or report accurately. For example, ketamine is known to be an adulterant of Ecstasy pills or an admixture of cocaine (called “CK”) [45‒47]. This adds another layer of complexity, emphasizing the need for methods that can detect or inquire about adulterants to ensure more reliable substance use assessments.

It is worth noting that the accuracy of substance use reporting might differ between population-based studies and substance-specific studies. Broader samples may yield less specific self-reports compared to focused studies on particular substances or populations. These specialized studies often employ more elaborate measures, including face-to-face interviews, possibly improving self-report validity [4]. Indeed, clinical case-control studies have shown high correlations between self-reported and hair-tested cocaine use measures [48].

Therefore, predicting response patterns with certainty is not feasible, and these findings highlight the necessity of using multiple methods to capture substance use accurately. After all, even though longitudinal investigations allow for a nuanced understanding of prevalence rates and reliabilities, prevalence rates still vary with age, suggesting consistency across measurements might change in line with changes in prevalence across age groups. Further analysis of how substance use patterns change within specific age groups and the ability to track them accurately would be valuable.

Factors Associated with Underreporting

Under- and overreporting are driven by different mechanisms, such as hair testing not being able to detect substances or metabolites (especially difficult for THC and amphetamine), misremembering the time frame of substance use, lack of knowledge about the substances (e.g., exposure to contaminated substances or receiving other substances than expected) and medications (e.g., not being able to distinguish opioid from non-opioid painkillers), or social desirability, stigma, and fear of consequences. Studies have demonstrated that, specifically, people from minority ethnicities [49, 50] and those having low antisocial behavior levels [49, 51] tend to underreport their substance use. Contrary to studies finding ethnicity effects for underreporting [27, 49], at age 24, we replicated the age 20 findings [5], supporting H3, showing no migration background effects in the models. Our longitudinal analyses also reinforce associations between underreporting and behavioral and psychological characteristics. The observed relationship between internalizing symptoms, delinquency, ADHD symptoms, and low self-control with underreporting of cocaine and codeine use may reflect a more consistent self-image that is in line with actual behavior, including substance use, or a general pattern of non-social behavior spanning from cocaine use to lower susceptibility to social desirability bias. This suggests that stable psychological and personal traits may impact the dynamics of substance use and its self-reporting, emphasizing the need to consider these characteristics in future studies. However, longitudinal investigations only show a stable pattern for higher delinquency and underreporting.

Limitations

Several limitations must be addressed. First, the binary classification of substance use might fail to capture possible nuances between occasional and regular consumption patterns. Second, the absence of MDMA at wave 8 for the 3-month self-reported use prevalence did not allow for direct comparisons. Third, direct comparisons were conducted using multiple tests for each substance, possibly inflating type I errors. Fourth, even though hair analysis is considered a “gold standard,” it still presents challenges, such as inaccurate detection of non-regular cannabis use and passive exposure to THC. Finally, demographic characteristics, such as gender and ethnicity, can influence the likelihood of agreement to provide hair samples [52].

Substance use prevalence among young adults in the Zurich region of Switzerland is high, and ideally, hair analyses and self-reports should be employed together as a multimethod approach to capture substance use. Although certain illegal substance use is often underreported, our findings indicate self-reported substance use is, for the most part, a dependable indicator of actual substance use. The consistent yet variable underreporting nevertheless highlights the importance of objectively assessing substance use, particularly for studies investigating the use of cocaine, Ecstasy/MDMA, ketamine, and codeine. Combining both methods is desirable for low- and high-risk samples to accurately determine various substances independent of age range.

The authors thank all participants, field managers, and the toxicology laboratory team for their invaluable contributions to the study’s success.

The study was reviewed and approved by the Cantonal Ethics Committee Zurich (BASEC #2017-02021) and the Ethics Committee of the Faculty of Arts and Social Sciences, University of Zurich, approval No. 2018.2.12 (phase 5) and approval No. 21.12.13 (phase 6). Written informed consent was obtained from participants to participate in the study.

The authors have no conflicts of interest to declare.

The Swiss National Science Foundation (SNSF) funded the study; grant #10531C_189008 was awarded to L.S. and grant #105314_214979 to B.B.Q. Furthermore, the z-proso study has been financially supported by the SNSF as a research infrastructure, grant #10F114_170409, awarded to Michael Shanahan, grant #10F114_198052, awarded to D.R. and L.S., by the Jacobs Center, and by the Jacobs Foundation (JF). In earlier stages of z-proso (2003–2016), the SNSF, JF, Swiss Federal Office of Public Health, Department of Education of the Canton of Zurich, Swiss State Secretariat of Migration and its predecessors, Julius Bär Foundation, and Visana Plus Foundation supported the study. Funding was provided for independent fundamental research.

C.J., L.E., A.S., L.S., and B.B.Q. initially conceptualized the study. C.J., L.E., A.S., L.B., D.R., M.R.B., and T.M.B. handled data curation. C.J., A.S., and L.B. took on the formal analysis. D.R., M.E., L.S., and B.B.Q. secured funding. D.R., M.E., M.R.B., T.M.B., L.S., and B.B.Q. were involved in the investigation. C.J., L.E., A.S., L.B., L.S., and B.B.Q. developed the research approach for methodology. L.S. and B.B.Q. managed project administration. C.J. and L.E. were responsible for data visualization. C.J. wrote the original draft and the manuscript’s review and editing saw contributions from L.E., A.S., L.J.F., L.B., M.L., D.R., M.E., M.R.B., T.M.B., L.S., and B.B.Q. All authors approved the final manuscript.

Additional Information

Lilly Shanahan and Boris B. Quednow contributed equally to this work.

The data that support the findings of this study are not publicly available due to sensitive personal information. However, researchers can request the data from the principal investigators (D.R.; [email protected]) of the z-proso project with a brief sketch of its planned use.

1.
United Nations Office on Drugs and Crime
.
World drug report 2023
.
Vienna
:
United Nations
;
2023
.
2.
National Institute on Drug Abuse (NIDA)
.
Common comorbidities with substance use disorders research report part 1: the connection between substance use disorders and mental illness
.
Bethesda, MD, USA
;
2021
.
3.
Barr
PB
,
Driver
MN
,
Kuo
SIC
,
Stephenson
M
,
Aliev
F
,
Linnér
RK
, et al
.
Clinical, environmental, and genetic risk factors for substance use disorders: characterizing combined effects across multiple cohorts
.
Mol Psychiatry
.
2022
;
27
(
11
):
4633
41
.
4.
Bharat
C
,
Webb
P
,
Wilkinson
Z
,
McKetin
R
,
Grebely
J
,
Farrell
M
, et al
.
Agreement between self‐reported illicit drug use and biological samples: a systematic review and meta‐analysis
.
Addiction
.
2023
;
118
(
9
):
1624
48
.
5.
Steinhoff
A
,
Shanahan
L
,
Bechtiger
L
,
Zimmermann
J
,
Ribeaud
D
,
Eisner
MP
, et al
.
When substance use is underreported: comparing self-reports and hair toxicology in an urban cohort of young adults
.
J Am Acad Child Adolesc Psychiatry
.
2023
;
62
(
7
):
791
804
.
6.
Palamar
JJ
,
Salomone
A
,
Keyes
KM
.
Underreporting of drug use among electronic dance music party attendees
.
Clin Toxicol
.
2021
;
59
(
3
):
185
92
.
7.
Palamar
JJ
,
Salomone
A
,
Vincenti
M
,
Cleland
CM
.
Detection of “bath salts” and other novel psychoactive substances in hair samples of ecstasy/MDMA/‘Molly’ users
.
Drug Alcohol Depend
.
2016
;
161
:
200
5
.
8.
Quednow
BB
,
Steinhoff
A
,
Bechtiger
L
,
Ribeaud
D
,
Eisner
M
,
Shanahan
L
.
High prevalence and early onsets: legal and illegal substance use in an urban cohort of young adults in Switzerland
.
Eur Addict Res
.
2022
;
28
(
3
):
186
98
.
9.
Degenhardt
L
,
Chiu
WT
,
Sampson
N
,
Kessler
RC
,
Anthony
JC
,
Angermeyer
M
, et al
.
Toward a global view of alcohol, tobacco, cannabis, and cocaine use: findings from the WHO world mental health surveys
.
PLoS Med
.
2008
;
5
(
7
):
e141
.
10.
Baggio
S
,
Studer
J
,
Mohler-Kuo
M
,
Daeppen
J
,
Gmel
G
.
Profiles of drug users in Switzerland and effects of early-onset intensive use of alcohol, tobacco and cannabis on other illicit drug use
.
Swiss Med Wkly
.
2013
;
143
:
w13805
.
11.
Delgrande Jordan
M
,
Schmidhauser
V
,
Balsiger
N
,
Delgrande Jordan
M
,
Schmidhauser
V
,
Balsiger
N
.
Santé et bien-être des 11 à 15 ans en Suisse–Situation en 2022, évolution dans le temps et corrélats–Résultats de l’étude Health Behaviour in School-aged Children (HBSC) (rapport de recherche No 159)
.
Addict Suisse
;
2023
.
12.
Cosma
A
,
Abdrakhmanova
S
,
Taut
D
,
Schrijvers
K
,
Catunda
C
,
Schnohr
C
.
A focus on adolescent mental health and well-being in Europe, central Asia and Canada. Health behaviour in school-aged children international report from the 2021/2022 survey
.
Copenhagen
:
WHO Regional Office for Europe
;
2023
.
Vol. 1
.
13.
Gmel
G
,
Kuendig
H
,
Notari
L
,
Gmel
C
.
Suchtmonitoring Schweiz - Konsum von Alkohol, Tabak und illegalen Drogen in der Schweiz im Jahr 2016
.
Lausanne
:
Sucht Schweiz
;
2017
.
14.
Obsan-Indikatoren
.
MonAM Schweizer Monitoring-System Sucht und nichtübertragbare Krankheiten
.
Obsan-Indikatoren
. Available from: https://ind.obsan.admin.ch/monam (Accessed: Mar 23, 204AD).
15.
Ribeaud
D
,
Loher
MT
.
Entwicklung von Gewalterfahrungen Jugendlicher im Kanton Zürich 1999-2021
;
2022
.
16.
Been
F
,
Bijlsma
L
,
Benaglia
L
,
Berset
JD
,
Botero-Coy
AM
,
Castiglioni
S
, et al
.
Assessing geographical differences in illicit drug consumption—a comparison of results from epidemiological and wastewater data in Germany and Switzerland
.
Drug Alcohol Depend
.
2016
;
161
:
189
99
.
17.
González-Mariño
I
,
Baz-Lomba
JA
,
Alygizakis
NA
,
Andrés-Costa
MJ
,
Bade
R
,
Bannwarth
A
, et al
.
Spatio-temporal assessment of illicit drug use at large scale: evidence from 7 years of international wastewater monitoring
.
Addiction
.
2020
;
115
(
1
):
109
20
.
18.
Towers
EB
,
Williams
IL
,
Qillawala
EI
,
Rissman
EF
,
Lynch
WJ
.
Sex/gender differences in the time-course for the development of substance use disorder: a focus on the telescoping effect
.
Pharmacol Rev
.
2023
;
75
(
2
):
217
49
.
19.
Broman
MJ
,
Bista
S
,
Broman
CL
.
Inconsistency in self-reporting the use of substances over time
.
Substance Use Misuse
.
2022
;
57
(
9
):
1356
64
.
20.
Johnson
T
,
Fendrich
M
.
Modeling sources of self-report bias in a survey of drug use epidemiology
.
Ann Epidemiol
.
2005
;
15
(
5
):
381
9
.
21.
Fendrich
M
,
Johnson
TP
,
Becker
J
.
The use of biological measures in social research on drug misuse
. In:
VanGeest
JB
,
Johnson
TP
,
Alemagno
SA
, editors.
Research methods in the study of substance abuse
.
Cham
:
Springer International Publishing
;
2017
. p.
285
313
.
22.
Fendrich
M
,
Johnson
TP
,
Wislar
JS
,
Hubbell
A
.
Drug test feasibility in a general population household survey
.
Drug Alcohol Depend
.
2004
;
73
(
3
):
237
50
.
23.
Palamar
JJ
,
Le
A
,
Guarino
H
,
Mateu-Gelabert
P
.
A comparison of the utility of urine- and hair testing in detecting self-reported drug use among young adult opioid users
.
Drug Alcohol Depend
.
2019
;
200
:
161
7
.
24.
Taylor
M
,
Lees
R
,
Henderson
G
,
Lingford-Hughes
A
,
Macleod
J
,
Sullivan
J
, et al
.
Comparison of cannabinoids in hair with self-reported cannabis consumption in heavy, light and non-cannabis users
.
Drug Alcohol Rev
.
2017
;
36
(
2
):
220
6
.
25.
Moosmann
B
,
Roth
N
,
Auwärter
V
.
Finding cannabinoids in hair does not prove cannabis consumption
.
Sci Rep
.
2015
;
5
(
1
):
14906
.
26.
Moosmann
B
,
Roth
N
,
Auwärter
V
.
Hair analysis for Δ9‐tetrahydrocannabinolic acid A (THCA-A) and Δ9-tetrahydrocannabinol (THC) after handling cannabis plant material
.
Drug Test Anal
.
2016
;
8
(
1
):
128
32
.
27.
Sharma
G
,
Oden
N
,
VanVeldhuisen
PC
,
Bogenschutz
MP
.
Hair analysis and its concordance with self-report for drug users presenting in emergency department
.
Drug Alcohol Depend
.
2016
;
167
:
149
55
.
28.
United Nations Office on Drugs and Crime
.
World drug report 2018, booklet 4, drugs and age: drugs and associated issues among young people and older people
.
Vienna
:
United Nations
;
2018
.
29.
Ribeaud
D
,
Murray
A
,
Shanahan
L
,
Shanahan
MJ
,
Eisner
M
.
Cohort profile: the Zurich project on the social development from childhood to adulthood (z-proso)
.
J Dev Life Course Criminol
.
2022
;
8
(
1
):
151
71
.
30.
Scholz
C
,
Cabalzar
J
,
Kraemer
T
,
Baumgartner
MR
.
A comprehensive multi-analyte method for hair analysis: substance-specific quantification ranges and tool for task-oriented data evaluation
.
J Anal Toxicol
.
2021
;
45
(
7
):
701
12
.
31.
Scholz
C
,
Madry
MM
,
Kraemer
T
,
Baumgartner
MR
.
LC–MS-MS analysis of δ9-THC, CBN and CBD in hair: investigation of artifacts
.
J Anal Toxicol
.
2022
;
46
(
5
):
504
11
.
32.
Tremblay
RE
,
Loeber
R
,
Gagnon
C
,
Charlebois
P
,
Larivée
S
,
LeBlanc
M
.
Disruptive boys with stable and unstable high fighting behavior patterns during junior elementary school
.
J Abnorm Child Psychol
.
1991
;
19
(
3
):
285
300
.
33.
Murray
AL
,
Obsuth
I
,
Eisner
M
,
Ribeaud
D
.
Evaluating longitudinal invariance in dimensions of mental health across adolescence: an analysis of the social behavior questionnaire
.
Assessment
.
2019
;
26
(
7
):
1234
45
.
34.
Ribeaud
D
,
Eisner
M
. Entwicklung von Gewalterfahrungen Jugendlicher im Kanton Zürich. Oberentfelden: Sauerländer. 2009. Available from: https://sjer.ch/article/view/5300 (Accessed: 13 Mar 2024).
35.
z-proso Project Team
.
Z-Proso handbook: instruments and procedures in the adolescent and young adult surveys (Age 11 to 24; Waves K4-K9)
;
2024
.
36.
Grasmick
HG
,
Tittle
CR
,
Bursik
RJ
,
Arneklev
BJ
.
Testing the core empirical implications of gottfredson and hirschi’s general theory of crime
.
J Res Crime Delinquency
.
1993
;
30
(
1
):
5
29
.
37.
Ganzeboom
HBG
,
De Graaf
PM
,
Treiman
DJ
.
A standard international socio-economic index of occupational status
.
Soc Sci Res
.
1992
;
21
(
1
):
1
56
.
38.
IBM Corp
.
IBM SPSS statistics for windows, version 29.0
.
Armonk, NY
:
IBM Corp
;
2022
.
39.
R CoreTeam
.
R: A language and environment for statistical computing
;
2021
. Available from: https://www.R-project.org/
40.
Wickham
H
.
ggplot2: elegant graphics for data analysis
. In:
Use R! Cham: springer international publishing
. 2nd ed.
Imprint
:
Springer
;
2016
.
41.
European Monitoring Centre for Drugs and Drug Addiction
.
Wastewater analysis and drugs—a European multi-city study
.
European union drugs agency
. Available from: https://www.emcdda.europa.eu/publications/html/pods/waste-water-analysis_en (Accessed: Mar 28, 2024).
42.
Bracone
A
.
Zweckentfremdete Anwendung von Codein in der Romandie
.
Prim Hosp Care
.
2024
.
43.
Addison
M
.
Framing stigma as an avoidable social harm that widens inequality
.
Sociol Rev
.
2023
;
71
(
2
):
296
314
.
44.
Solimini
R
,
Rotolo
MC
,
Pellegrini
M
,
Minutillo
A
,
Pacifici
R
,
Busardò
FP
, et al
.
Adulteration practices of psychoactive illicit drugs: an updated review
.
Chem Pharm Bull
.
2017
;
18
(
7
):
524
30
.
45.
S Gold
M
,
Lud Cadet
J
,
Baron
D
,
D Badgaiyan
R
,
Blum
K
.
Calvin klein (CK) designer cocktail, new “Speedball” is the “grimm reaper”: brain dopaminergic surge a potential death sentence
.
J Syst Integr Neurosci
.
2020
;
6
(
3
).
46.
Palamar
JJ
,
Salomone
A
,
Rutherford
C
,
Keyes
KM
.
Extensive underreported exposure to ketamine among electronic dance music party attendees
.
J Gen Intern Med
.
2021
;
36
(
1
):
235
7
.
47.
Waddell-Smith
RJH
.
A review of recent advances in impurity profiling of illicit MDMA samples
.
J Forensic Sci
.
2007
;
52
(
6
):
1297
304
.
48.
Grison
S
,
Johnson-Ferguson
L
,
Vonmoos
M
,
Baumgartner
MR
,
Quednow
BB
.
Associations between self-reported cocaine use patterns and cocaine and its metabolites in hair: Implications for clinical and forensic practices
.
Drug Test Anal. In
.
press
.
49.
Ledgerwood
DM
,
Goldberger
BA
,
Risk
NK
,
Lewis
CE
,
Kato Price
R
.
Comparison between self-report and hair analysis of illicit drug use in a community sample of middle-aged men
.
Addict Behaviors
.
2008
;
33
(
9
):
1131
9
.
50.
Fendrich
M
,
Johnson
TP
,
Wislar
JS
,
Hubbell
A
,
Spiehler
V
.
The utility of drug testing in epidemiological research: results from a general population survey
.
Addiction
.
2004
;
99
(
2
):
197
208
.
51.
Taylor
M
,
Sullivan
J
,
Ring
SM
,
Macleod
J
,
Hickman
M
.
Assessment of rates of recanting and hair testing as a biological measure of drug use in a general population sample of young people: recanting and hair testing in young people
.
Addiction
.
2017
;
112
(
3
):
477
85
.
52.
Won
NY
,
Jabot
B
,
Wang
A
,
Palamar
JJ
,
Cottler
LB
.
Willingness to provide a hair sample for drug testing: results from an anonymous multi-city intercept survey
.
Am J Drug Alcohol Abuse
.
2024
;
50
(
2
):
261
8
.