"Artificial sweeteners such as aspartame and sucralose promote weight gain and metabolic disease."

health nutrition · generated 2026-04-01 · v1.3.1
PARTIALLY VERIFIED 4 citations
Evidence assessed across 4 verified citations.
Verified by Proof Engine — an open-source tool that verifies claims using cited sources and executable code. Reasoning transparent and auditable.
methodology · github · re-run this proof · submit your own

The association between artificial sweeteners and weight gain is real and well-documented — but the claim that sweeteners cause these outcomes does not hold up when put to rigorous experimental test.

What Was Claimed?

The claim is that sweeteners like aspartame and sucralose actively drive weight gain and metabolic problems such as diabetes and cardiovascular disease. This is a common concern, widely cited in health media, and has real stakes: hundreds of millions of people use these sweeteners daily as a supposedly safer alternative to sugar. If the sweeteners themselves are making things worse, that's a significant public health question.

What Did We Find?

The picture splits cleanly into two parts: what observational research shows, and what experimental research shows — and they point in different directions.

On the observational side, the evidence is consistent and substantial. A 2017 meta-analysis combining 30 long-term cohort studies and over 400,000 participants found that non-nutritive sweetener consumption was associated with higher rates of obesity, metabolic syndrome, type 2 diabetes, hypertension, and cardiovascular events. A separate 25-year study tracking over 3,000 American adults found that aspartame and saccharin consumers had greater risks of developing obesity and larger volumes of body fat over time. A third large survey found that aspartame consumers showed steeper impairments in blood sugar control as their weight increased. Three independent research groups, using different populations and methods, all found the same pattern.

But here is where the story complicates. When researchers run randomized controlled trials — the gold standard for establishing cause and effect — the picture reverses. Across multiple meta-analyses totaling hundreds of trials and over a thousand participants, artificial sweeteners show no weight gain, no worsening of blood sugar, no increase in metabolic disease markers. One analysis of 15 trials found that sweetener users actually lost slightly more weight on average. Another large analysis found that switching from sugary drinks to sweetened diet drinks produced effects comparable to switching to plain water.

The most likely explanation for the gap between observational and experimental findings is reverse causality: people who are already overweight or at metabolic risk are the ones most likely to choose diet products. This creates a real statistical association in population studies without any causal arrow running from sweetener to disease. The research teams themselves acknowledge this. The WHO's 2023 recommendation against sweeteners for weight control was issued at its weakest evidence grade specifically because this confound cannot be resolved with observational data alone.

One additional nuance: sucralose, specifically, showed no significant association with weight or fat tissue outcomes in the longest-running prospective study. The observational associations that do exist are primarily for aspartame and saccharin.

What Should You Keep In Mind?

The observational associations are real and not trivial — multiple large, well-conducted studies found them consistently. The possibility that some biological mechanism connects sweeteners to metabolic outcomes cannot be ruled out entirely. One study proposed that sweeteners may disrupt gut bacteria in ways that affect blood sugar regulation, though this was demonstrated primarily in mice and a very small human sample.

The experimental evidence contradicts causation as currently understood, but RCTs on this topic tend to be shorter in duration than the decades-long observational studies, and they may not capture all relevant outcomes. Absence of proof from trials is not the same as proof of absence.

The evidence also does not address all sweeteners equally, all doses, or all populations — children, people with existing metabolic conditions, and heavy long-term users may behave differently than the average trial participant.

How Was This Verified?

This claim was broken into two sub-claims — one about observed associations in human populations, one about established causation — and each was tested independently against the published literature. See the structured proof report for the full evidence summary and reasoning, the full verification audit for citation-level detail and adversarial checks, or re-run the proof yourself to reproduce the verdict from scratch.

What could challenge this verdict?

1. Do RCTs show that artificial sweeteners cause weight gain or metabolic disease?

Multiple meta-analyses of RCTs — the gold standard for establishing causation — show no weight gain and no metabolic harm: - Miller & Perez 2014 (AJCN, 15 RCTs): found a modest weight decrease (−0.80 kg; 95% CI: −1.17, −0.43) from low-calorie sweetener use. - McGlynn et al. 2022 (JAMA Network Open, network meta-analysis): found low/no-calorie sweetened beverages perform comparably to water when substituted for sugar-sweetened beverages. - Qin et al. 2025 (Frontiers in Nutrition, 9 RCTs, 1,457 participants): found no statistically significant differences in body weight, waist circumference, fasting blood glucose, HbA1c, insulin resistance, or blood pressure.

This is strong counter-evidence against SC2. It explains why SC2 fails: the experimental evidence that would establish causation actively contradicts the claimed direction of effect.

2. Is the observational association confounded by reverse causality?

Reverse causality — overweight or metabolically at-risk individuals preferentially choosing diet products — is the dominant alternative explanation for all observational associations. Azad et al. 2017 explicitly acknowledges this: "The cohort results may reflect confounding by indication, as people who are overweight or at risk of metabolic disease may choose nonnutritive sweeteners." The WHO 2023 guideline classifies its recommendation as "conditional" — the weakest guidance tier — precisely because the evidence is predominantly observational and subject to this confound.

3. Does the evidence apply equally to both aspartame AND sucralose?

The CARDIA 2023 study found that sucralose showed "all ptrend > 0.05" — no significant association with adipose tissue volumes or incident obesity — while aspartame and saccharin showed significant positive associations. The claim names both compounds, but the observational associations documented in SC1 are primarily for aspartame and saccharin, not sucralose specifically.

4. Does the WHO 2023 guideline establish causation?

The WHO 2023 guideline recommends against non-sugar sweeteners (NSS) for weight control but classifies this as a "conditional" (not "strong") recommendation. The WHO news page uses associational language: "potential undesirable effects from long-term use of NSS, such as an increased risk of type 2 diabetes, cardiovascular diseases, and mortality in adults." Harvard experts noted the WHO meta-analysis excluded large studies (>100,000 participants) showing beneficial substitution effects. The guideline does not establish causation.

Source: proof.py JSON summary adversarial_checks


Sources

SourceIDTypeVerified
Azad et al. 2017, CMAJ — Nonnutritive sweeteners and cardiometabolic health (PMC) B1 Government Yes
Steffen et al. 2023, International Journal of Obesity (CARDIA Study) — Long-term artificial sweetener intake and adiposity (PubMed) B2 Government Yes
Kuk & Brown 2016, Applied Physiology Nutrition and Metabolism — Aspartame intake and glucose tolerance in NHANES III (PubMed) B3 Government Yes
Suez et al. 2014, Nature — Artificial sweeteners induce glucose intolerance by altering the gut microbiota (PubMed) B4 Government Yes
SC1 confirmed source count A1 Computed
SC2 confirmed source count A2 Computed

detailed evidence

Detailed Evidence

Evidence Summary

ID Fact Verified
B1 Azad et al. 2017 CMAJ — meta-analysis (30 cohort studies, 405,907 participants) finding observational association with weight, waist circumference, obesity, T2D, metabolic syndrome, CVD Yes
B2 Steffen et al. 2023 Int J Obesity (CARDIA) — 25-year prospective cohort (N=3,088), aspartame and saccharin associated with adipose tissue volumes and incident obesity Yes
B3 Kuk & Brown 2016 APNM — NHANES III cross-sectional (N=2,856), aspartame associated with greater obesity-related glucose intolerance Yes
B4 Suez et al. 2014 Nature — gut-microbiome mechanism proposed for non-caloric artificial sweetener (NAS)-induced glucose intolerance (primarily mouse model, limited human data; insufficient for SC2 threshold) Yes
A1 SC1 confirmed source count Computed: 3 independent sources confirmed (threshold ≥ 3 — met)
A2 SC2 confirmed source count Computed: 1 independent source confirmed (threshold ≥ 3 — NOT met)

Source: proof.py JSON summary


Proof Logic

SC1 — Association with weight gain and metabolic disease

Three independent peer-reviewed sources confirm the association:

B1 (Azad et al. 2017, CMAJ): A systematic review and meta-analysis found that across 30 cohort studies with 405,907 participants and a median 10-year follow-up, "consumption of nonnutritive sweeteners was associated with increases in weight and waist circumference, and higher incidence of obesity, hypertension, metabolic syndrome, type 2 diabetes and cardiovascular events." This is the broadest and most comprehensive epidemiological synthesis available. The same review found that RCTs did not show clear benefit for weight management — a crucial divergence discussed under SC2.

B2 (Steffen et al. 2023, CARDIA Study): A 25-year prospective cohort study of 3,088 US adults found that "ArtSw, including diet soda, was associated with greater risks of incident obesity." Aspartame and saccharin (but not sucralose) were specifically associated with visceral, subcutaneous, and intermuscular adipose tissue volumes. This is the longest-running prospective data available on this question.

B3 (Kuk & Brown 2016, APNM/NHANES III): A cross-sectional analysis of 2,856 US adults found that "consumption of aspartame is associated with greater obesity-related impairments in glucose tolerance" — specifically, that aspartame consumers showed a steeper positive association between body mass index (BMI) and glucose intolerance than non-consumers. This documents a metabolic disease marker (impaired glucose tolerance) in addition to the weight associations above.

Three independent sources — different study designs, different populations, different research groups — all confirm the association. SC1 holds (3/3 confirmed, threshold ≥ 3).

SC2 — Causal relationship established

Only one source was identified that addresses the causal mechanism:

B4 (Suez et al. 2014, Nature): Proposes that "consumption of commonly used NAS formulations drives the development of glucose intolerance through induction of compositional and functional alterations to the intestinal microbiota." However, this study is primarily based on mouse experiments, with a very small human validation component (n=7 participants in a controlled intervention; n=381 in an observational component). It identifies a plausible mechanism but does not constitute causal evidence in representative human populations.

No RCTs, Mendelian randomization studies, or Bradford Hill analyses establishing human causation were found. SC2 fails (1/3 confirmed, threshold ≥ 3 — not met).

Compound evaluation

With SC1 holding and SC2 failing, the compound verdict is: 1 of 2 sub-claims hold → PARTIALLY VERIFIED.

Source: author analysis


Conclusion

Verdict: PARTIALLY VERIFIED

  • SC1 (association) — CONFIRMED: 3 independent sources (B1, B2, B3 — all fully verified) document statistically significant associations between artificial sweetener consumption and weight gain / metabolic disease outcomes in human populations. The association is consistent across a meta-analysis of 30 cohort studies, a 25-year prospective cohort, and a large cross-sectional survey.

  • SC2 (causation) — NOT ESTABLISHED: Only 1 source was identified for the causal sub-claim (a largely animal-based mechanistic study, B4), well below the threshold of 3. Multiple meta-analyses of RCTs — the highest-quality causal evidence — contradict the causal claim, showing neutral or modest beneficial effects on weight. The observational associations are almost certainly confounded by reverse causality. No human RCT, Mendelian randomization study, or equivalent causal analysis establishes that sweeteners cause weight gain or metabolic disease.

The claim's use of "promote" — implying causation — is not supported by the evidence. The more defensible statement is that artificial sweetener consumption is associated with weight gain and metabolic disease in observational data, but this association is not established as causal. The experimental literature points in the opposite direction for weight outcomes.

Additionally, the evidence for sucralose specifically is weaker than for aspartame, with the CARDIA 2023 study finding no significant adiposity associations for sucralose.

audit trail

Citation Verification 4/4 verified

All 4 citations verified.

Original audit log

B1 — Azad et al. 2017, CMAJ (PMC) - Status: verified - Method: full_quote - Fetch mode: live - Coverage: N/A (full quote match)

B2 — Steffen et al. 2023, CARDIA (PubMed) - Status: verified - Method: full_quote - Fetch mode: live - Coverage: N/A (full quote match) - Note: "ArtSw" is the abbreviation for artificial sweeteners as used in the abstract of the paper.

B3 — Kuk & Brown 2016, APNM (PubMed) - Status: verified - Method: full_quote - Fetch mode: live - Coverage: N/A (full quote match)

B4 — Suez et al. 2014, Nature (PubMed) - Status: verified - Method: full_quote - Fetch mode: live - Coverage: N/A (full quote match) - Note: "NAS" = non-caloric artificial sweeteners, as used in the paper's abstract. The study primarily examines saccharin (not aspartame or sucralose), with results validated in a small human sample (n=7 controlled intervention; n=381 observational). Even with a verified citation, this source is insufficient for SC2: it is a mechanistic/animal study and does not constitute a randomized controlled trial or equivalent causal inference method in humans.

Source: proof.py JSON summary


Claim Specification
Field Value
Subject artificial sweeteners, specifically aspartame and sucralose
Compound operator AND
Proof direction affirm
SC1 property associated with weight gain and metabolic disease in human observational studies
SC1 operator >=
SC1 threshold 3
SC1 operator_note SC-association: at least 3 independent peer-reviewed sources must document a statistically significant positive association between artificial sweetener consumption and weight gain or metabolic disease markers (obesity, T2D, cardiovascular disease, metabolic syndrome, impaired glucose tolerance) in human populations. Observational study designs (cohort, cross-sectional) are sufficient for SC1.
SC2 property causal relationship established via RCTs or equivalent causal inference methods in humans
SC2 operator >=
SC2 threshold 3
SC2 operator_note SC-causation: the word 'promote' implies causation. At least 3 independent sources must establish causation using RCTs, Mendelian randomization, Bradford Hill criteria, or equivalent methods. Observational sources do not satisfy SC2. Animal mechanistic studies do not satisfy SC2 without confirming human experimental data.
Compound operator_note Both sub-claims must hold for PROVED. Reverse causality — overweight/at-risk individuals preferentially choosing diet products — cannot be eliminated without experimental design. If only SC1 holds: PARTIALLY VERIFIED.

Source: proof.py JSON summary claim_formal


Claim Interpretation

Natural-language claim: "Artificial sweeteners such as aspartame and sucralose promote weight gain and metabolic disease."

The word "promote" in the claim implies causation — not mere correlation. A causal claim requires evidence that sweeteners actively cause weight gain and metabolic disease, as opposed to merely being observed alongside it. This claim is therefore decomposed into two sub-claims, both of which must hold for the compound claim to be PROVED:

  • SC1 (SC-association): Artificial sweetener consumption is associated with weight gain and metabolic disease in human populations. Satisfied by observational study designs (cohort studies, cross-sectional surveys) documenting statistically significant associations. Threshold: ≥ 3 independent verified sources.

  • SC2 (SC-causation): The association is causal — not explained by reverse causality or other confounding. Satisfied only by randomized controlled trials, Mendelian randomization studies, Bradford Hill criteria assessments, or equivalent causal inference methods in humans. Animal models and proposed mechanisms are insufficient without confirming human experimental data. Threshold: ≥ 3 independent verified sources.

Purely observational evidence does not satisfy SC2, regardless of sample size, because the dominant confound — people who are already overweight choosing diet products — cannot be eliminated without experimental design. If only SC1 holds, the verdict is PARTIALLY VERIFIED.

Source: proof.py JSON summary claim_formal and claim_natural


Source Credibility Assessment
Fact ID Domain Type Tier Note
B1 nih.gov (PMC) government 5 US National Library of Medicine database; paper published in CMAJ (peer-reviewed)
B2 nih.gov (PubMed) government 5 US National Library of Medicine database; paper published in Int J Obesity (peer-reviewed)
B3 nih.gov (PubMed) government 5 US National Library of Medicine database; paper published in APNM (peer-reviewed)
B4 nih.gov (PubMed) government 5 US National Library of Medicine database; paper published in Nature (peer-reviewed)

All citations are accessed via US government academic databases (PubMed/PMC). All underlying papers are published in peer-reviewed journals. No sources are flagged as low-credibility.

Source: proof.py JSON summary citations[].credibility


Computation Traces
SC1 confirmed sources: 3 / 3
SC2 confirmed sources: 1 / 1
SC1: association with weight gain and metabolic disease: 3 >= 3 = True
SC2: causation established via RCTs or causal inference: 1 >= 3 = False
compound: all sub-claims hold: 1 == 2 = False

Source: proof.py inline output (execution trace)


Independent Source Agreement

SC1 — Association sources

Check Value
Sources consulted 3
Sources verified 3
sc1_source_a (Azad 2017) verified
sc1_source_b (Steffen 2023) verified
sc1_source_c (Kuk 2016) verified
Independence note Sources are from different research groups, institutions, and study designs: Azad 2017 (Canadian systematic review/meta-analysis of 30 international cohort studies), Steffen 2023 (US CARDIA longitudinal cohort), Kuk 2016 (US NHANES cross-sectional survey). Different populations, methods, and outcome measures provide genuine independence.

SC2 — Causation sources

Check Value
Sources consulted 1
Sources verified 1
sc2_source_a (Suez 2014) verified
Independence note Only 1 SC2 source was identified. It proposes a gut-microbiome mechanism but is primarily mouse-model data with a very small human validation (n=7 intervention, n=381 observational). No RCTs, Mendelian randomization studies, or Bradford Hill analyses establishing causation in human populations were found. Multiple RCT meta-analyses (Miller 2014, McGlynn 2022, Qin 2025) actively contradict the causal claim. The Rule 6 warning for SC2 having only 1 source is intentional: there genuinely are no additional qualifying causal sources, and the experimental evidence contradicts causation.

Source: proof.py JSON summary cross_checks


Adversarial Checks

Check 1: Do RCTs show that artificial sweeteners cause weight gain or metabolic disease? - Verification performed: Searched "artificial sweeteners RCT weight gain randomized controlled trial meta-analysis". Reviewed Miller & Perez 2014 (AJCN, 15 RCTs), McGlynn et al. 2022 (JAMA Network Open, network meta-analysis), Qin et al. 2025 (Frontiers in Nutrition, 9 RCTs, 1,457 participants), and Toews et al. 2019 (BMJ, pre-specified WHO systematic review). - Finding: Multiple RCT meta-analyses show no weight gain and no metabolic harm. Miller & Perez 2014 found a modest weight DECREASE (−0.80 kg; 95% CI: −1.17, −0.43) across 15 RCTs. McGlynn et al. 2022 found low/no-calorie sweetened beverages comparable to water. Qin et al. 2025 found no significant differences in any metabolic marker across 9 RCTs. This is strong counter-evidence against SC2 and is why it fails. - Breaks proof: No — SC1 (observational association) is logically independent of SC2 and still holds. The RCT evidence informs the PARTIALLY VERIFIED verdict correctly.

Check 2: Is the observational association confounded by reverse causality? - Verification performed: Searched "artificial sweeteners reverse causality confounding observational studies". Reviewed Azad et al. 2017 CMAJ limitations section, WHO 2023 guideline evidence grade, and Toews et al. 2019 BMJ on confounding. - Finding: Reverse causality is the dominant competing explanation. Azad et al. 2017 acknowledges "The cohort results may reflect confounding by indication, as people who are overweight or at risk of metabolic disease may choose nonnutritive sweeteners." The WHO 2023 guideline is "conditional" specifically because of this limitation. - Breaks proof: No — this confirms SC2 fails (causation not established), consistent with PARTIALLY VERIFIED.

Check 3: Does the evidence apply equally to both aspartame AND sucralose? - Verification performed: Searched "sucralose weight gain adiposity evidence" and reviewed Steffen et al. 2023 (CARDIA) per-sweetener results for sucralose. - Finding: CARDIA 2023 found sucralose showed "all ptrend > 0.05" — no significant association with adipose tissue or obesity — while aspartame and saccharin showed significant associations. The SC1 evidence is primarily for aspartame, not sucralose. - Breaks proof: No — SC1 treats the sweetener class overall (multiple compounds across multiple studies), and the class-level association holds. However, this weakens the claim's specificity for sucralose.

Check 4: Does the WHO 2023 guideline establish causation? - Verification performed: Reviewed the WHO May 2023 news release directly. Checked evidence grade. Reviewed Harvard T.H. Chan School commentary (June 2023). - Finding: The WHO 2023 guideline is "conditional" (weakest guidance tier) due to predominantly observational evidence. The WHO statement uses associational language ("potential undesirable effects"). Harvard experts noted the WHO meta-analysis excluded large studies (>100,000 participants) showing beneficial substitution effects. The guideline does not establish causation. - Breaks proof: No — consistent with PARTIALLY VERIFIED.

Source: proof.py JSON summary adversarial_checks


Quality Checks
Rule Status Notes
Rule 1: Every empirical value parsed from quote text, not hand-typed N/A Qualitative proof — no numeric values extracted from quotes. Citation verification status used as the counting mechanism.
Rule 2: Every citation URL fetched and quote checked PASS All 4 citations verified via verify_all_citations(). Status: B1=verified, B2=verified, B3=verified, B4=verified. All via live fetch.
Rule 3: System time used for date-dependent logic PASS date.today() present in proof.py. Proof is not time-sensitive but the generator timestamp uses the system date.
Rule 4: Claim interpretation explicit with operator rationale PASS CLAIM_FORMAL dict with operator_note at compound level and per sub-claim. Causal decomposition into SC1 and SC2 is explicitly documented.
Rule 5: Adversarial checks searched for independent counter-evidence PASS 4 adversarial checks: RCT meta-analyses contradicting causation, reverse causality confounding, sucralose-specific evidence, WHO guideline evidence grade.
Rule 6: Cross-checks used independently sourced inputs PASS (with warning) SC1 has 3 sources from different research groups and study designs. SC2 has 1 source — intentional, reflecting genuine absence of human causal evidence; documented in cross_checks independence_note.
Rule 7: Constants and formulas imported from computations.py, not hand-coded N/A No numeric constants or formulas. compare() from computations.py used for all evaluations.
validate_proof.py result PASS with 2 warnings 17/19 checks passed. 0 issues. 2 warnings: both are Rule 6 warnings for SC2 having only 1 source — intentional and documented.

Source: proof.py inline output + author analysis

Source Data

For qualitative/consensus proofs, each B-type fact records citation verification status rather than extracted numeric values.

Fact ID Value (citation status) Value in Quote (countable) Quote Snippet
B1 verified True "consumption of nonnutritive sweeteners was associated with increases in weight a"
B2 verified True "ArtSw, including diet soda, was associated with greater risks of incident obesit"
B3 verified True "consumption of aspartame is associated with greater obesity-related impairments "
B4 verified True "consumption of commonly used NAS formulations drives the development of glucose "

Source: proof.py JSON summary extractions


Cite this proof
Proof Engine. (2026). Claim Verification: “Artificial sweeteners such as aspartame and sucralose promote weight gain and metabolic disease.” — Partially verified. https://doi.org/10.5281/zenodo.19455606
Proof Engine. "Claim Verification: “Artificial sweeteners such as aspartame and sucralose promote weight gain and metabolic disease.” — Partially verified." 2026. https://doi.org/10.5281/zenodo.19455606.
@misc{proofengine_artificial_sweeteners_such_as_aspartame_and_sucralose_promote_weight_gain_and,
  title   = {Claim Verification: “Artificial sweeteners such as aspartame and sucralose promote weight gain and metabolic disease.” — Partially verified},
  author  = {{Proof Engine}},
  year    = {2026},
  url     = {https://proofengine.info/proofs/artificial-sweeteners-such-as-aspartame-and-sucralose-promote-weight-gain-and/},
  note    = {Verdict: PARTIALLY VERIFIED. Generated by proof-engine v1.3.1},
  doi     = {10.5281/zenodo.19455606},
}
TY  - DATA
TI  - Claim Verification: “Artificial sweeteners such as aspartame and sucralose promote weight gain and metabolic disease.” — Partially verified
AU  - Proof Engine
PY  - 2026
UR  - https://proofengine.info/proofs/artificial-sweeteners-such-as-aspartame-and-sucralose-promote-weight-gain-and/
N1  - Verdict: PARTIALLY VERIFIED. Generated by proof-engine v1.3.1
DO  - 10.5281/zenodo.19455606
ER  -
View proof source 411 lines · 18.7 KB

This is the exact proof.py that was deposited to Zenodo and runs when you re-execute via Binder. Every fact in the verdict above traces to code below.

"""
Proof: Artificial sweeteners such as aspartame and sucralose promote weight gain and metabolic disease.
Generated: 2026-04-01

Claim type: Causal compound claim (SC-association + SC-causation).
"Promote" implies causation, not mere correlation. Decomposed per skill instructions.
"""
import json
import os
import sys

PROOF_ENGINE_ROOT = os.environ.get("PROOF_ENGINE_ROOT")
if not PROOF_ENGINE_ROOT:
    _d = os.path.dirname(os.path.abspath(__file__))
    while _d != os.path.dirname(_d):
        if os.path.isdir(os.path.join(_d, "proof-engine", "skills", "proof-engine", "scripts")):
            PROOF_ENGINE_ROOT = os.path.join(_d, "proof-engine", "skills", "proof-engine")
            break
        _d = os.path.dirname(_d)
    if not PROOF_ENGINE_ROOT:
        raise RuntimeError("PROOF_ENGINE_ROOT not set and skill dir not found via walk-up from proof.py")
sys.path.insert(0, PROOF_ENGINE_ROOT)

from datetime import date

from scripts.verify_citations import verify_all_citations, build_citation_detail
from scripts.computations import compare

# 1. CLAIM INTERPRETATION (Rule 4)
CLAIM_NATURAL = (
    "Artificial sweeteners such as aspartame and sucralose promote weight gain "
    "and metabolic disease."
)
CLAIM_FORMAL = {
    "subject": "artificial sweeteners, specifically aspartame and sucralose",
    "sub_claims": [
        {
            "id": "SC1",
            "property": (
                "associated with weight gain and metabolic disease "
                "in human observational studies"
            ),
            "operator": ">=",
            "threshold": 3,
            "operator_note": (
                "SC-association: at least 3 independent peer-reviewed sources must document "
                "a statistically significant positive association between artificial sweetener "
                "consumption and weight gain or metabolic disease markers (obesity, T2D, "
                "cardiovascular disease, metabolic syndrome, impaired glucose tolerance) "
                "in human populations. Observational study designs (cohort, cross-sectional) "
                "are sufficient for SC1."
            ),
        },
        {
            "id": "SC2",
            "property": (
                "causal relationship established via RCTs or equivalent causal "
                "inference methods in humans"
            ),
            "operator": ">=",
            "threshold": 3,
            "operator_note": (
                "SC-causation: the word 'promote' in the claim implies causation, not mere "
                "correlation. At least 3 independent sources must establish a causal (not "
                "merely associational) relationship using RCTs, Mendelian randomization, "
                "Bradford Hill criteria assessment, or equivalent causal inference methods. "
                "Purely observational sources do not satisfy SC2 regardless of sample size. "
                "Mechanistic evidence from animal studies does not satisfy SC2 in the absence "
                "of confirming human RCT or causal inference data."
            ),
        },
    ],
    "compound_operator": "AND",
    "proof_direction": "affirm",
    "operator_note": (
        "Both sub-claims must hold for the compound claim to be PROVED. "
        "SC1 (association) and SC2 (causation) are logically independent: an association "
        "can hold in observational data without establishing causation. The dominant "
        "confound in observational studies is reverse causality — people who are already "
        "overweight or at metabolic risk preferentially choose diet/low-calorie products, "
        "producing a spurious association without any causal mechanism operating from "
        "sweetener to disease. If only SC1 holds, the verdict is PARTIALLY VERIFIED "
        "with the notation that association is documented but causation is not established."
    ),
}

# 2. FACT REGISTRY
FACT_REGISTRY = {
    "B1": {
        "key": "sc1_source_a",
        "label": (
            "Azad et al. 2017 CMAJ — systematic review/meta-analysis finding observational "
            "association between NNS consumption and weight gain plus metabolic outcomes "
            "across 30 cohort studies (405,907 participants)"
        ),
    },
    "B2": {
        "key": "sc1_source_b",
        "label": (
            "Steffen et al. 2023 Int J Obesity (CARDIA Study) — 25-year prospective cohort "
            "(N=3,088), aspartame and saccharin positively associated with adipose tissue "
            "volumes and incident obesity"
        ),
    },
    "B3": {
        "key": "sc1_source_c",
        "label": (
            "Kuk & Brown 2016 APNM — NHANES III cross-sectional study (N=2,856), "
            "aspartame consumption associated with greater obesity-related glucose intolerance"
        ),
    },
    "B4": {
        "key": "sc2_source_a",
        "label": (
            "Suez et al. 2014 Nature — proposed gut-microbiome mechanism for NAS-induced "
            "glucose intolerance (primarily mouse model, limited human data; "
            "insufficient for SC2 causal threshold)"
        ),
    },
    "A1": {"label": "SC1 confirmed source count", "method": None, "result": None},
    "A2": {"label": "SC2 confirmed source count", "method": None, "result": None},
}

# 3. EMPIRICAL FACTS — grouped by sub-claim
empirical_facts = {
    # SC1 sources: observational evidence of association
    "sc1_source_a": {
        "quote": (
            "consumption of nonnutritive sweeteners was associated with increases in weight "
            "and waist circumference, and higher incidence of obesity, hypertension, metabolic "
            "syndrome, type 2 diabetes and cardiovascular events"
        ),
        "url": "https://pmc.ncbi.nlm.nih.gov/articles/PMC5515645/",
        "source_name": "Azad et al. 2017, CMAJ — Nonnutritive sweeteners and cardiometabolic health (PMC)",
    },
    "sc1_source_b": {
        "quote": (
            "ArtSw, including diet soda, was associated with greater risks of incident obesity"
        ),
        "url": "https://pubmed.ncbi.nlm.nih.gov/37443272/",
        "source_name": (
            "Steffen et al. 2023, International Journal of Obesity (CARDIA Study) — "
            "Long-term artificial sweetener intake and adiposity (PubMed)"
        ),
    },
    "sc1_source_c": {
        "quote": (
            "consumption of aspartame is associated with greater obesity-related impairments "
            "in glucose tolerance"
        ),
        "url": "https://pubmed.ncbi.nlm.nih.gov/27216413/",
        "source_name": (
            "Kuk & Brown 2016, Applied Physiology Nutrition and Metabolism — "
            "Aspartame intake and glucose tolerance in NHANES III (PubMed)"
        ),
    },
    # SC2 source: proposed causal mechanism — insufficient on its own to establish causation in humans
    "sc2_source_a": {
        "quote": (
            "consumption of commonly used NAS formulations drives the development of glucose "
            "intolerance through induction of compositional and functional alterations to the "
            "intestinal microbiota"
        ),
        "url": "https://pubmed.ncbi.nlm.nih.gov/25231862/",
        "source_name": (
            "Suez et al. 2014, Nature — Artificial sweeteners induce glucose intolerance by "
            "altering the gut microbiota (PubMed)"
        ),
    },
}

# 4. CITATION VERIFICATION (Rule 2)
citation_results = verify_all_citations(empirical_facts, wayback_fallback=True)

# 5. COUNT VERIFIED SOURCES PER SUB-CLAIM
COUNTABLE_STATUSES = ("verified", "partial")
sc1_keys = [k for k in empirical_facts if k.startswith("sc1_")]
sc2_keys = [k for k in empirical_facts if k.startswith("sc2_")]

n_sc1 = sum(1 for k in sc1_keys if citation_results[k]["status"] in COUNTABLE_STATUSES)
n_sc2 = sum(1 for k in sc2_keys if citation_results[k]["status"] in COUNTABLE_STATUSES)

print(f"  SC1 confirmed sources: {n_sc1} / {len(sc1_keys)}")
print(f"  SC2 confirmed sources: {n_sc2} / {len(sc2_keys)}")

# 6. PER-SUB-CLAIM EVALUATION — each uses compare()
sc1_holds = compare(
    n_sc1, ">=", CLAIM_FORMAL["sub_claims"][0]["threshold"],
    label="SC1: association with weight gain and metabolic disease"
)
sc2_holds = compare(
    n_sc2, ">=", CLAIM_FORMAL["sub_claims"][1]["threshold"],
    label="SC2: causation established via RCTs or causal inference"
)

# 7. COMPOUND EVALUATION
n_holding = sum([sc1_holds, sc2_holds])
n_total = len(CLAIM_FORMAL["sub_claims"])
claim_holds = compare(n_holding, "==", n_total, label="compound: all sub-claims hold")

# 8. ADVERSARIAL CHECKS (Rule 5)
adversarial_checks = [
    {
        "question": (
            "Do RCTs show that artificial sweeteners CAUSE weight gain or metabolic disease?"
        ),
        "verification_performed": (
            "Searched 'artificial sweeteners RCT weight gain randomized controlled trial meta-analysis'. "
            "Reviewed Miller & Perez 2014 (AJCN, 15 RCTs), McGlynn et al. 2022 (JAMA Network Open, "
            "network meta-analysis), Qin et al. 2025 (Frontiers in Nutrition, 9 RCTs, 1,457 participants), "
            "and Toews et al. 2019 (BMJ, pre-specified WHO systematic review)."
        ),
        "finding": (
            "Multiple meta-analyses of RCTs — the gold standard for causal inference — show no weight "
            "gain and no metabolic harm from artificial sweeteners: Miller & Perez 2014 found a modest "
            "weight DECREASE (−0.80 kg; 95% CI: −1.17, −0.43) from low-calorie sweetener use across "
            "15 RCTs; McGlynn et al. 2022 found low/no-calorie sweetened beverages perform comparably "
            "to water when substituted for sugar-sweetened beverages; Qin et al. 2025 found no "
            "statistically significant differences in body weight, waist circumference, fasting blood "
            "glucose, HbA1c, insulin resistance, or blood pressure across 9 RCTs. This is strong "
            "counter-evidence against SC2 (causation) and is why SC2 fails to meet its threshold. "
            "SC1 (observational association) still holds independently."
        ),
        "breaks_proof": False,
    },
    {
        "question": (
            "Is the observational association confounded by reverse causality "
            "(overweight/at-risk people choosing diet products)?"
        ),
        "verification_performed": (
            "Searched 'artificial sweeteners reverse causality confounding observational studies'. "
            "Reviewed Azad et al. 2017 CMAJ limitations section, WHO 2023 guideline evidence grade, "
            "and Toews et al. 2019 BMJ on confounding."
        ),
        "finding": (
            "Reverse causality is the dominant competing explanation for all observational associations. "
            "Azad et al. 2017 explicitly acknowledges: 'The cohort results may reflect confounding by "
            "indication, as people who are overweight or at risk of metabolic disease may choose "
            "nonnutritive sweeteners.' The WHO 2023 guideline classifies its recommendation as "
            "'conditional' — the weakest WHO guidance tier — specifically because the evidence is "
            "predominantly observational and subject to this confound. This confirms why SC1 "
            "(association documented) does not imply SC2 (causation established), and is consistent "
            "with the PARTIALLY VERIFIED verdict."
        ),
        "breaks_proof": False,
    },
    {
        "question": (
            "Does the evidence apply equally to both aspartame AND sucralose as named in the claim?"
        ),
        "verification_performed": (
            "Searched 'sucralose weight gain adiposity evidence' and reviewed Steffen et al. 2023 "
            "(CARDIA) per-sweetener results for sucralose specifically."
        ),
        "finding": (
            "The CARDIA 2023 study found that sucralose showed 'all ptrend > 0.05' — no significant "
            "association with adipose tissue volumes or incident obesity — while aspartame and saccharin "
            "showed significant positive associations. The observational association documented in SC1 "
            "is primarily driven by aspartame and saccharin, not sucralose. The claim names both "
            "aspartame and sucralose, but SC1 as verified is stronger for aspartame than sucralose. "
            "SC2 (causation) is not established for either compound. This weakens the claim's "
            "specificity for sucralose but does not break SC1 overall, which treats the class level "
            "association as confirmed by multiple independent studies."
        ),
        "breaks_proof": False,
    },
    {
        "question": (
            "Does the WHO 2023 guideline establish that sweeteners CAUSE weight gain?"
        ),
        "verification_performed": (
            "Reviewed the WHO May 2023 news release directly. Checked the evidence grade assigned. "
            "Also reviewed Harvard T.H. Chan School commentary (June 2023) on the WHO evidence review."
        ),
        "finding": (
            "The WHO 2023 guideline recommends against NSS use for weight control but explicitly "
            "classifies this as a 'conditional' — not 'strong' — recommendation due to the "
            "predominantly observational evidence base. The WHO news page states: 'potential "
            "undesirable effects from long-term use of NSS, such as an increased risk of type 2 "
            "diabetes, cardiovascular diseases, and mortality in adults' — which is associational "
            "language, not causal. Harvard experts additionally noted that the WHO meta-analysis "
            "excluded large studies (>100,000 participants) showing beneficial substitution effects. "
            "The guideline does not establish causation and does not contradict the PARTIALLY "
            "VERIFIED verdict."
        ),
        "breaks_proof": False,
    },
]

# 9. VERDICT AND STRUCTURED OUTPUT
if __name__ == "__main__":
    any_unverified = any(
        cr["status"] != "verified" for cr in citation_results.values()
    )
    any_breaks = any(ac.get("breaks_proof") for ac in adversarial_checks)
    is_disproof = CLAIM_FORMAL.get("proof_direction") == "disprove"
    uncertainty_override = False

    if any_breaks or uncertainty_override:
        verdict = "UNDETERMINED"
    elif not claim_holds and n_holding > 0:
        # Mixed: SC1 holds, SC2 does not — partially verified
        verdict = "PARTIALLY VERIFIED"
    elif claim_holds and not any_unverified:
        verdict = "DISPROVED" if is_disproof else "PROVED"
    elif claim_holds and any_unverified:
        verdict = (
            "DISPROVED (with unverified citations)"
            if is_disproof
            else "PROVED (with unverified citations)"
        )
    elif not claim_holds and n_holding == 0:
        verdict = "UNDETERMINED"
    else:
        verdict = "UNDETERMINED"

    FACT_REGISTRY["A1"]["method"] = f"count(verified sc1 citations) = {n_sc1}"
    FACT_REGISTRY["A1"]["result"] = str(n_sc1)
    FACT_REGISTRY["A2"]["method"] = f"count(verified sc2 citations) = {n_sc2}"
    FACT_REGISTRY["A2"]["result"] = str(n_sc2)

    citation_detail = build_citation_detail(FACT_REGISTRY, citation_results, empirical_facts)

    # Extractions: for qualitative/consensus proofs, each B-type fact records citation status
    extractions = {}
    for fid, info in FACT_REGISTRY.items():
        if not fid.startswith("B"):
            continue
        ef_key = info["key"]
        cr = citation_results.get(ef_key, {})
        extractions[fid] = {
            "value": cr.get("status", "unknown"),
            "value_in_quote": cr.get("status") in COUNTABLE_STATUSES,
            "quote_snippet": empirical_facts[ef_key]["quote"][:80],
        }

    summary = {
        "fact_registry": {
            fid: {k: v for k, v in info.items()}
            for fid, info in FACT_REGISTRY.items()
        },
        "claim_formal": CLAIM_FORMAL,
        "claim_natural": CLAIM_NATURAL,
        "citations": citation_detail,
        "extractions": extractions,
        "cross_checks": [
            {
                "description": "SC1: independent sources consulted for association evidence",
                "n_sources_consulted": len(sc1_keys),
                "n_sources_verified": n_sc1,
                "sources": {k: citation_results[k]["status"] for k in sc1_keys},
                "independence_note": (
                    "Sources are from different research groups, institutions, and study designs: "
                    "Azad 2017 (Canadian meta-analysis), Steffen 2023 (US CARDIA cohort), "
                    "Kuk 2016 (US NHANES cross-sectional). Different populations, methods, "
                    "and outcome measures."
                ),
            },
            {
                "description": "SC2: independent sources consulted for causal evidence",
                "n_sources_consulted": len(sc2_keys),
                "n_sources_verified": n_sc2,
                "sources": {k: citation_results[k]["status"] for k in sc2_keys},
                "independence_note": (
                    "Only 1 SC2 source was identified (Suez 2014); it proposes a gut-microbiome "
                    "mechanism but is primarily mouse-model data with a small human intervention "
                    "(n=7). No RCTs, Mendelian randomization studies, or Bradford Hill analyses "
                    "establishing causation in human populations were found. RCT meta-analyses "
                    "(Miller 2014, McGlynn 2022, Qin 2025) actively contradict the causal claim."
                ),
            },
        ],
        "sub_claim_results": [
            {
                "id": "SC1",
                "n_confirming": n_sc1,
                "threshold": CLAIM_FORMAL["sub_claims"][0]["threshold"],
                "holds": sc1_holds,
            },
            {
                "id": "SC2",
                "n_confirming": n_sc2,
                "threshold": CLAIM_FORMAL["sub_claims"][1]["threshold"],
                "holds": sc2_holds,
            },
        ],
        "adversarial_checks": adversarial_checks,
        "verdict": verdict,
        "key_results": {
            "sc1_n_confirming": n_sc1,
            "sc1_threshold": CLAIM_FORMAL["sub_claims"][0]["threshold"],
            "sc1_holds": sc1_holds,
            "sc2_n_confirming": n_sc2,
            "sc2_threshold": CLAIM_FORMAL["sub_claims"][1]["threshold"],
            "sc2_holds": sc2_holds,
            "n_holding": n_holding,
            "n_total": n_total,
            "claim_holds": claim_holds,
        },
        "generator": {
            "name": "proof-engine",
            "version": open(os.path.join(PROOF_ENGINE_ROOT, "VERSION")).read().strip(),
            "repo": "https://github.com/yaniv-golan/proof-engine",
            "generated_at": date.today().isoformat(),
        },
    }

    print("\n=== PROOF SUMMARY (JSON) ===")
    print(json.dumps(summary, indent=2, default=str))

↓ download proof.py · view on Zenodo (immutable)

Re-execute this proof

The verdict above is cached from when this proof was minted. To re-run the exact proof.py shown in "View proof source" and see the verdict recomputed live, launch it in your browser — no install required.

Re-execute the exact bytes deposited at Zenodo.

Re-execute in Binder runs in your browser · ~60s · no install

First run takes longer while Binder builds the container image; subsequent runs are cached.

machine-readable formats

Jupyter Notebook interactive re-verification W3C PROV-JSON provenance trace RO-Crate 1.1 research object package
Downloads & raw data

found this useful? ★ star on github