You are here:

Conflation of prediction and causality in the TB literature

Published on

Updated:

We fast-track accepted articles to ensure that important scientific research is shared as rapidly as possible and publish them as edited preprints, prior to publication in the IJTLD, IJTLD OPEN or PHA.

To subscribe to the IJTLD online – either as an Institute or as an Individual – please read our Subscription Information, or email subscription@theunion.org for further details.

Read the free PDF for more information

M.L. Romo,1 L. Barcellini,2 M.F. Franke,1 P.Y. Khan3,4

SUMMARY

BACKGROUND: Observational data can answer both predictive and etiologic research questions; however, the model-building approach and interpretation of results differ based on the research goal (i.e., prediction versus causal inference). Conflation occurs when aspects of the methodology and/or interpretation that are unique to prediction or etiology are combined or confused, potentially leading to biased results and erroneous conclusions.

METHODS: We conducted a rapid review using MEDLINE (2018–2023) of a subset of the observational TB literature: cohort studies among people with drug-resistant TB that considered HIV status an exposure of interest and reported on TB treatment outcomes. For each article, we assessed the research question, statistical approach, presentation of results, and discussion and interpretation of results.

RESULTS: Among the 40 articles included, 32 (80%) had evidence of conflation. The most common specific types of conflation were recommending or proposing interventions to modify exposures in a predictive study and having a causal interpretation of predictors, with both types frequently co-occurring.

CONCLUSION: Conflation between prediction and etiology was common, highlighting the importance of increasing awareness about it and its potential consequences. We propose simple steps on how TB and lung health researchers can avoid conflation, beginning with clearly defining the research question. 

 

Studies relevant to human health can broadly be classified as descriptive, predictive, or etiologic based on the questions they aim to answer (Table 1).1 Descriptive studies quantify and characterize health phenomena.2 Predictive studies identify factors that predict an outcome or develop a prediction model.3,4 Etiologic studies estimate the causal effect of an exposure on an outcome to inform interventions.5–7 Descriptive and predictive studies typically rely on observational data.2–4 For etiologic studies, experimental studies with random exposure assignment (i.e., randomized trials) to address confounding are considered the gold standard, but ethical considerations as well as feasibility and practicality have driven the advancement of methods for estimating causal effects from observational data.5 Studies might also have simultaneous but separate descriptive, predictive, and etiologic objectives.

The research question, model-building approaches, and interpretation of results are distinct for predictive and etiologic studies, as only the latter must consider the underlying causal structure of the exposure-outcome relationship (Table 1).5–7 The overarching goal of prediction is to identify who will develop (i.e., prognosis) or already has (i.e., diagnosis) an outcome, irrespective of causality. However, both predictive and etiologic studies can use the same observational data and similar statistical approaches, like multivariable regression. In this context, ‘conflation’ between prediction and etiology, i.e., combining or confusing aspects of methodology and/or interpretation, can occur.8 Conflation is especially relevant for observational data and is expected to be less of an issue with primary analyses of experimental studies, as randomized trials are specifically designed, analyzed, and reported to answer causal ‘what if’ types of research questions.

Conflation can potentially lead to biased results and erroneous conclusions, which may have implications for patient care and public health and the direction of a research field. If an etiologic study adjusts for variables that are solely predictive of the outcome rather than confounders (i.e., common causes of the exposure and outcome), variables on the causal pathway, such as colliders and mediators, might be included in the model and bias the effect estimate (e.g., risk difference/ratio). For example, incorrectly adjusting for the mediator of birth weight can bias the effect of maternal smoking on infant mortality towards the null.9 Misinterpretation of estimates from regression models in both etiologic and predictive studies can lead to erroneous conclusions. For example, estimates from predictive studies that were misinterpreted as causal contributed to the misconception of a protective effect of cigarette smoking on SARS-CoV-2 infection risk, which led to unproductive research into nicotine and possibly increased tobacco consumption during the COVID-19 pandemic.10

We examined conflation between prediction and etiology in the observational TB literature by conducting a review and assessment of a subset of articles on HIV and drug-resistant TB (DR-TB). Observational studies could conceivably explore HIV as a predictor or a cause of unfavorable DR-TB treatment outcomes. We assessed articles for conflation, and described the frequencies and types of conflation to provide practical recommendations for how the TB and lung health research community can avoid conflation between prediction and etiology.

METHODS

We conducted a rapid review of the observational HIV and DR-TB literature and applied interim reporting recommendations.11 The review was guided by a protocol that was developed a priori by two of the authors (LB and PYK) and is summarized as follows:

Eligible articles reported the findings from cohort studies among people with DR-TB and considered HIV status as the primary factor of interest and/or included it in multivariable analyses and reported on TB treatment outcomes. We searched MEDLINE for articles in the prior five years from the search date (03 July 2023), using Medical Subject Heading and free text terms related to our topic (Supplementary Data Table S1). We excluded articles that were not in English, not a complete research report, or if the full-text article was unavailable. A single reviewer (LB) conducted the search, and title, abstract, and full-text screening. Two reviewers (LB and MLR) independently assessed study characteristics relevant to predictive and etiologic studies and evidence of conflation in four domains: research question, statistical approach, presentation of results, and discussion and interpretation of results. For this assessment, we adapted a tool with signaling questions from a prior review focused on conflation (Supplementary Data Table S2).8 The signaling questions probed about specific aspects unique to predictive and etiologic studies in each domain. A response of yes, no, unclear, or not applicable was recorded for each. We also included ‘Table 2 Fallacy’ whereby coefficients for non-main effects in an etiologic study (e.g., confounders) are interpreted and/or presented, typically in the second table of a research paper.12 Although all regression coefficients from a predictive study should be presented and interpreted, this is not the case for etiologic studies, which are typically focused on a specific exposure and interpreting coefficients of confounders may lead to incorrect conclusions about their role in the causal pathway. Based on the responses to the signaling questions, each of the four domains was classified as predictive, etiologic, conflated, or unclear. Additionally, a domain could be classified as both predictive and etiologic (without conflation) if the study clearly had separate predictive and etiologic research questions and these were treated distinctly in any given domain. The two reviewers compared their findings and came to a consensus on any discordance. Any remaining disagreements were resolved by consulting two senior researchers (PYK, MFF).

Among the articles included, we computed frequencies of classifications for each domain and the article overall, and types of conflation. An article was classified as conflated if any one of the four domains was classified as conflated (i.e., containing both etiologic and predictive elements) or at least two domains had discordant classifications (e.g., the research question was predictive and the discussion and interpretation of results was etiologic). An article was classified as unclear if the research question was unclear and the criteria for conflation were not met or if the research question was both etiologic and predictive and subsequent domains were neither both nor conflated. Otherwise, an article could be classified as etiologic, predictive, or both.

RESULTS

Of 177 articles identified, 69 were excluded based on their title and abstract and 108 were sought for retrieval. Of these, we were unable to retrieve 7, and of the 101 full-text articles, 61 were excluded, leaving 40 articles included in our review (Supplementary Data Figure S1). Figure 1 provides classifications for the four domains and the summary classification for each article. Table 2 contains illustrative example text from selected articles from the review13–20 and how they were classified.

The research question domain was classified as predictive for 16 (40%), as etiologic for 6 (15%), as both predictive and etiologic for one (3%), and as unclear for 17 (43%). The statistical approach domain was classified as predictive for 27 (68%), as etiologic for 3 (8%), as unclear for 6 (15%), and as conflated for 4 (10%). The results presentation domain was classified as predictive for 11 (28%), as etiologic for 4 (10%), as unclear for 18 (45%), and as conflated for 7 (18%). The discussion and interpretation of results domain was classified as predictive for 2 (5%), as etiologic for 20 (50%), as unclear for 1 (3%), and as conflated for 17 (43%). When considering all four domains, 2 (5%) articles were classified as predictive, none as etiologic, 6 (15%) as unclear, and 32 (80%) as conflated. Of the 32 articles with evidence of conflation, 23 (72%) had at least one domain classified as conflated and the other 9 (28%) articles had discordance among the domains.

Specific types of conflation and their overlap (Figure 2) were examined for etiologic (A–D) and predictive studies (E–H). Of the 6 articles with a clearly etiologic research question, 3 (50%) had adjustment variables selected based on ability to predict the outcome (A). Of the 7 articles with an etiologic research question and/or statistical approach, none reported predictive performance (B), 6 (86%) had evidence of ‘Table 2 Fallacy’ (C), and 2 (29%) recommended risk stratification and/or application in individuals for diagnosis or prognostic purposes (D). Of the 16 articles with a clearly predictive research question, none reported selecting covariates based on the causal structure (E). Of the 30 articles with a predictive research question and/or statistical approach, 15 (50%) had a causal interpretation of predictors (F), 19 (63%) recommended or proposed interventions to modify exposures (G), and 4 (13%) described residual confounding as a limitation (H). There was substantial overlap between F and G, with 14 (93%) of 15 articles that had a causal interpretation of predictors also recommending or proposing interventions to modify exposures.

DISCUSSION

In the recent peer-reviewed observational literature on HIV and DR-TB treatment, conflation between prediction and etiology was common. Publications with a predictive research question and/or statistical approach frequently had a causal interpretation of results and also recommended interventions to modify exposures based on the results. Additionally, we found that the specification of research questions and reporting of results were frequently unclear with respect to etiology or prediction. Compared with a previous conflation review by Ramspek et al. ours had a narrower scope (HIV and DR-TB vs. six broad medical fields) and breadth (40 articles vs. 180).8 They classified 127 (71%) of studies as etiologic whereas only 7 (18%) in our search had an etiologic research question and/or statistical approach. We also identified a larger proportion of articles with conflation (80% vs. 26%). These differences could be due to their search focusing on top-ranked journals, which might favor etiologic studies for their greater perceived impact and have more stringent editorial policies and rigorous peer review procedures resulting in less conflation. We agree with their recommendations for researchers and suggest the following steps to avoid conflation that are informed by our review.

  1. Clearly define the research question(s) or objective(s) regarding description, etiology, and prediction

The research question should be clearly specified before undertaking the analysis, as analytic approaches can differ depending on the underlying goal. Effectively conveying the research question in the development of the manuscript provides the foundation for describing the statistical approach, reporting of results and the discussion, without conflating prediction and etiology. In our review, 43% of articles had an unclear research question, which was frequently related to authors stating that they examined ‘associations’ of different variables with outcomes. Since both etiologic and predictive studies examine associations between variables, clarification is needed to convey the goal of the study, which can be achieved by using more precise language. Some words that link an exposure with an outcome, like ‘cause’ or ‘prevent,’ have a less ambiguous interpretation than others.21 For an etiologic study, using a term like ‘causal effect’ makes the research goal clear (e.g., ‘what is the causal effect of HIV on mortality among people with rifampicin-resistant TB?’). Unfortunately, causal language is sometimes discouraged in scientific journals, particularly for observational studies,22 which may make it easier to conflate prediction and etiology. We encourage authors to justify their word choice if they receive opposition from reviewers and editors.

For a predictive study, authors should also use clear language about the research goal and provide context about a prognostic or diagnostic purpose, and if the goal is to identify predictive factors or develop a prediction model. For example, the question ‘what comorbidities, including HIV, predict mortality during treatment among people with rifampicin-resistant TB?’ clearly has the goal of identifying predictive factors and the description of the outcome and time horizon (i.e., mortality during treatment) implies a prognostic purpose.

  1. Analyze the data according to the research question or objective

In the 23 studies with a clear predictive and/or etiologic research question, 8 (35%) did not have a statistical approach that matched the research question. Similar statistical methods can be used for both etiologic and predictive studies (e.g., multivariable regression). However, model-building approaches differ, with etiologic studies adjusting for confounders and predictive studies including variables that predict the outcome without regard to the underlying causal structure. Researchers should specify an appropriate analysis plan before attempting to analyze data. Such a plan should be designed for the research question, for example, by including how confounding will be addressed for an etiologic question. To ensure appropriate use of methods, we recommend engaging a methodologist whenever possible, preferably at the inception of planning a study. The review by Ramspek et al. found that conflation was less frequent when an epidemiology department was listed in the author affiliations.8 Individuals with training in epidemiology and/or biostatistics are ideal to engage because these fields are focused on the application of data methods for scientific inquiry.23,24

  1. Report and interpret results according to the research question and methods used

The most common reasons for conflation related to reporting and interpretation of results. ‘Table 2 Fallacy’ and a causal interpretation of predictors were common among etiologic and predictive studies, respectively. A good practice to improve the quality of the reporting is to adhere to checklists like STROBE25 for observational studies with an etiologic objective and TRIPOD26 for prediction model studies, even if not explicitly required by a journal. Such checklists provide guidance to authors on what they should include in a manuscript for clearer reporting and therefore might make identifying conflation between etiology and prediction easier. However, these checklists do not recommend specific practices to avoid conflation. For example, STROBE instructs reporting unadjusted and confounder-adjusted estimates but does not explicitly state that authors should avoid reporting or interpreting coefficients from confounders for a study with an etiologic objective (i.e., ‘Table 2 Fallacy’). An important limitation is that neither TRIPOD nor STROBE is fully applicable to studies that aim to identify factors that are predictive of an outcome, which were common in our review. TRIPOD contains reporting items specific to the development, specification, and performance of prediction models, which is beyond the scope of solely identifying predictive factors. STROBE contains multiple references confounding and confounders in both the reporting of methods and results, which are not applicable to these predictive studies.

            As with defining the research question, authors must take care with the language used to interpret their findings and the conclusions they make but must also ensure that their interpretation coincides with their underlying research goal (e.g., avoid causal language when interpreting predictive associations). We found that a causal interpretation of predictors was common and frequently co-occurred with recommending or proposing interventions to modify exposures. Because factors identified as predictors might not be causal of the outcome, they could be inappropriate intervention targets. Conflating predictive results as causal might lead to recommendations or proposals for interventions targeting exposures that, even if effectively intervened upon, do not impact the outcome of interest. Many of the predictive articles in our review examined individual-level predictors of DR-TB treatment outcomes, including comorbidities such as HIV. Therefore, recommendations in the discussion should focus on risk stratification, i.e., identifying the characteristics of people at higher risk for an unfavorable outcome who might benefit from additional monitoring or support, rather than intervening on these characteristics.

Our review is subject to limitations. Some potentially eligible articles for inclusion may have been missed due to use of a rapid review methodology. However, our objective was to assess conflation between prediction and etiology within a subset of the TB literature, rather than to systematically review the evidence on the association between HIV and DR-TB treatment outcomes. Errors in interpretation may have led to some misclassification of domains and articles, but this was minimized by having two researchers independently conduct assessments, compare their findings and come to consensus, and resolve any discordances with senior researchers on the team. Furthermore, we restricted assessments to article text and avoided assumptions about the authors’ intent. Because our review focused on a subset of the TB literature, the generalizability of our findings to the broader TB literature is unknown, but we would not expect this to be an issue solely affecting articles related to HIV and DR-TB treatment. Our findings highlight the importance of increasing awareness of conflation between prediction and etiology in the broader TB and lung health research community.

Ending the world’s oldest and deadliest pandemic of TB requires leveraging the best science.27,28 As stated by Professor Doug Altman, ‘to maximize the benefit to society, you need to not just do research but do it well.’29 Achieving the best science means doing research well and reporting it well. Conflation between prediction and etiology in observational research presents an obstacle to this goal, as it can lead to biased results, erroneous conclusions, and steer a research field in an inefficient or wrong direction.8 We found that the avoidable phenomenon of conflation between prediction and etiology was common in a subset of the peer-reviewed TB literature, and provide practical recommendations on how to avoid it.

Acknowledgements
MLR and MFF were supported by grant from the National Institute of Allergy and Infectious Diseases of the US National Institutes of Health (R03AI180576, PI: Franke).

Conflicts of interest: none declared.

References

  1. Hernán MA, Hsu J, Healy B. A Second Chance to Get Causal Inference Right: A Classification of Data Science Tasks. CHANCE 2019;32(1):42-49.
  2. Lesko CR, Fox MP, Edwards JK. A Framework for Descriptive Epidemiology. Am J Epidemiol 2022;191(12):2063-2070.
  3. Riley RD, et al. Prognosis Research Strategy (PROGRESS) 2: prognostic factor research. PLoS Med. 2013;10(2):e1001380.
  4. Efthimiou O, et al. Developing clinical prediction models: a step-by-step guide. BMJ. 2024;386:e078276.
  5. Hernán MA, Robins JM (2020). Causal Inference: What If. Boca Raton: Chapman & Hall/CRC.
  6. Lederer DJ, et al. Control of Confounding and Reporting of Results in Causal Inference Studies. Guidance for Authors from Editors of Respiratory, Sleep, and Critical Care Journals. Ann Am Thorac Soc. 2019;16(1):22-28.
  7. Schooling CM, Jones HE. Clarifying questions about "risk factors": predictors versus explanation. Emerg Themes Epidemiol. 2018;15:10.
  8. Ramspek CL, et al. Prediction or causality? A scoping review of their conflation within current observational research. Eur J Epidemiol 2021;36(9):889-898.
  9. Hernández-Díaz S, Schisterman EF, Hernán MA. The birth weight "paradox" uncovered?. Am J Epidemiol. 2006;164(11):1115-1120.
  10. van Westen-Lagerweij NA, et al. Are smokers protected against SARS-CoV-2 infection (COVID-19)? The origins of the myth. NPJ Prim Care Respir Med. 2021;31(1):10.
  11. Stevens A, et al. Rapid review method series: interim guidance for the reporting of rapid reviews. BMJ Evid Based Med 2025;30:118-123.
  12. Westreich D, Greenland S. The table 2 fallacy: presenting and interpreting confounder and modifier coefficients. Am J Epidemiol 2013;177(4):292-8.
  13. Ndjeka N, et al. Treatment outcomes 24 months after initiating short, all-oral bedaquiline-containing or injectable-containing rifampicin-resistant tuberculosis treatment regimens in South Africa: a retrospective cohort study. Lancet Infect Dis 2022;22(7):1042-1051.
  14. Tobon A, et al. Adverse treatment outcomes in multidrug resistant tuberculosis go beyond the microbe-drug interaction: Results of a multiple correspondence analysis. Biomedica 2020;40(4):616-625.
  15. Tola HH, et al. Low enrollment and high treatment success in children with drug-resistant tuberculosis in Ethiopia: A ten years national retrospective cohort study. PLoS One 2020;15(2):e0229284.
  16. Kuaban A, et al. Incidence and factors associated with unfavourable treatment outcome among patients with rifampicin-resistant pulmonary tuberculosis in Yaounde, Cameroon. Pan Afr Med J 2021;38:229.
  17. Plesca V, et al. Do catastrophic costs impact treatment outcomes in people with rifampicin-resistant tuberculosis in the Republic of Moldova? Monaldi Arch Chest Dis 2021;91(1).
  18. Zhao Y, et al. Improved Treatment Outcomes With Bedaquiline When Substituted for Second-line Injectable Agents in Multidrug-resistant Tuberculosis: A Retrospective Cohort Study. Clin Infect Dis. 2019;68(9):1522-1529.
  19. Kintu TM, et al. Unfavorable treatment outcomes among patients with drug-resistant TB in Uganda. Int J Tuberc Lung Dis 2023;27(4):291-297.
  20. Safaev K, et al. Trends, Characteristics and Treatment Outcomes of Patients with Drug-Resistant Tuberculosis in Uzbekistan: 2013-2018. Int J Environ Res Public Health 2021;18(9):4663.
  21. Haber NA, et al. Causal and Associational Language in Observational Health Research: A Systematic Evaluation. Am J Epidemiol 2022;191(12):2084-2097.
  22. Hernan MA. The C-Word: Scientific Euphemisms Do Not Improve Causal Inference From Observational Data. Am J Public Health. 2018;108(5):616-619.
  23. Samet JM, Woodward A. On Being an Epidemiologist. Am J Epidemiol 2019;188(5):818-824.
  24. Zapf A, Rauch G, Kieser M. Why do you need a biostatistician?. BMC Med Res Methodol. 2020;20(1):23.
  25. Vandenbroucke JP, et al. Strengthening the Reporting of Observational Studies in Epidemiology (STROBE): explanation and elaboration. PLoS Med 2007;4(10):e297.
  26. Collins GS, et al. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): the TRIPOD statement. BMJ 2015;350:g7594.
  27. Kim PS, Swaminathan S. Ending TB: the world's oldest pandemic. J Int AIDS Soc 2021;24(3):e25698.
  28. Reid M, et al. Scientific advances and the end of tuberculosis: a report from the Lancet Commission on Tuberculosis. Lancet 2023;402(10411):1473-1498.
  29. Sauerbrei W, et al. Doug Altman: Driving critical appraisal and improvements in the quality of methodological and medical research. Biom J 2021;63(2):226-246.