PEER-REVIEWED · mSYSTEMS (ASM) · OPEN ACCESS (CC BY 4.0)

The science behind precision synbiotics for autism.

A detailed breakdown of the largest published study on personalized probiotics and the autism-spectrum microbiome. 296 participants. Machine-learning-driven formulation. Published outcomes with full transparency.

Phan J, Calvo DC, Nair D, Jain S, et al.Arizona State University + Sun Genomics (Flore Inc.)
PublishedmSystems, April 25, 2024
In this page
  1. Study overview
  2. Study design and cohort
  3. Sequencing and bioinformatics
  4. Machine learning classifier
  5. Baseline microbiome differences
  6. Precision formulation pipeline
  7. Microbiome outcomes post-intervention
  8. GI symptom outcomes (GSRS)
  9. Behavioral and developmental outcomes
  10. Metabolic pathway analysis
  11. Correlations
  12. Safety and tolerability
  13. Limitations — in full
  14. Full citation and disclosures

1. Study overview

The study — “Precision synbiotics increase gut microbiome diversity and improve gastrointestinal symptoms in a pilot open-label study for autism spectrum disorder” — was published in mSystems (American Society for Microbiology) on April 25, 2024. It is the largest published investigation of personalized probiotic + prebiotic (synbiotic) intervention designed specifically for autistic individuals.

296
ASD participants enrolled
123
Neurotypical controls
170
Completed full protocol
3 mo
Supplementation period

The core hypothesis: autistic individuals harbor a measurably distinct microbiome signature that can be identified via machine learning, and that personalized synbiotics (custom probiotics + matched prebiotics) formulated from that signature can increase microbial diversity and reduce GI symptoms.

Why this study matters: Most probiotic studies use a single, one-size-fits-all formula. This study formulated a unique blend for each participant — 4 to 8 probiotic strains plus 1 to 2 prebiotics, selected from over 100 scientifically studied ingredients based on each person's individual microbiome data.

2. Study design and cohort

This was a pilot open-label study — participants knew they were receiving the intervention. There was no placebo arm. The neurotypical controls (n=123) provided baseline microbiome samples for the ML classifier but did not receive the intervention.

IRB approval: Arizona State University, STUDY00012299. Registered on ClinicalTrials.gov before enrollment began.

Enrollment criteria

Cohort demographics

CharacteristicASD group (n=296)Neurotypical (n=123)
Mean age10.41 yrs (SD 7.14)10.74 yrs (SD 8.71)
Age range2.5–75 yearsAge-matched
Male79.7%52.0%
Diet quality (below avg or poor)~60%Not collected
Average diet22.6%Not collected
Excellent or very good diet17.5%Not collected
GFCF diet~33%Not collected
Vegetarian5%Not collected
Taking additional supplements~65%Not collected
Geography (USA)89%96%

Retention: 170 of 296 (57.4%) completed the full protocol. Dropout reasons among the 126 who left: lack of perceived benefit (34%), price (21%), customer service issues (3 cases). On average, 5–6 months elapsed between the baseline sample and the second timepoint sample receipt and processing.

Note on gender imbalance: The ASD group was 79.7% male vs. 52.0% in controls. The authors note this is a limitation. However, an initial analysis found no significant differences in microbial alpha or beta diversity between sexes in either cohort in this dataset.

3. Sequencing and bioinformatics

This study did not use standard 16S rRNA amplicon sequencing with a public pipeline. Instead, it used a proprietary metagenomic approach:

What this means: The resolution is significantly higher than typical 16S studies. Instead of looking at one gene region, the sequencing captures the full genetic content, enabling species-level (and sometimes strain-level) identification across ~23,000 reference organisms. This is the same technology used to build each participant's personalized formula.

4. Machine learning classifier

The research team built a stochastic gradient boosting classifier to distinguish ASD microbiomes from neurotypical microbiomes using the baseline sequencing data.

Model specification

ParameterValue
AlgorithmStochastic gradient boosting
Training / test split75% / 25%
Interaction depth tested1, 5, 9
Number of trees tested50–1,500 (step 50)
Shrinkage0.1
Optimal interaction depth9
Optimal tree count1,350

Results

0.95
ROC AUC (training)
0.90
Sensitivity (training)
21.8%
Validation accuracy

Important caveat: The training-set performance (AUC 0.95, sensitivity 0.90) looks impressive, but the validation accuracy was only 21.8%. The authors describe this as a “low level of predictive accuracy” on held-out data. This means the model can detect patterns in the training data but does not generalize reliably to new samples for predicting ASD status from microbiome data alone. The model's value is in identifying which taxa differ — not as a diagnostic tool.

A separate Random Forest analysis on the baseline cohort comparison had an out-of-bag error rate of 17.54%. From the top 50 features, 28 taxa reached significance after FDR correction and are described in the next section.

5. Baseline microbiome differences

The Random Forest analysis identified 28 taxa significantly different between ASD and neurotypical cohorts after false discovery rate (FDR) correction:

Elevated in ASD

The following genera/species were found at significantly greater proportions in the ASD cohort:

Reduced in ASD (higher in controls)

These taxa were at significantly larger proportions in the neurotypical cohort:

Baseline diversity

MetricASD (n=296)Neurotypical (n=123)Significance
Shannon index3.90 (SD 0.48)4.04 (SD 0.38)Significantly lower in ASD
RichnessLowerHigherSignificantly lower in ASD
EvennessLowerHigherSignificantly lower in ASD

Beta diversity (baseline)

Figure 3: Differential proportion of microbes between ASD and neurotypical cohorts at baseline
Bar chart showing differential proportions of microbial taxa between ASD and neurotypical cohorts at baseline. Taxa like Faecalibacterium, Ruminococcus, and Fusicatenibacter are depleted in ASD; Clostridium, Klebsiella, and Shigella are elevated.
Taxa to the right are more abundant in ASD; taxa to the left are more abundant in neurotypical controls. 28 taxa reached significance after FDR correction.
Source: Phan et al., mSystems 2024, Fig. 3. CC BY 4.0.

In plain language: Autistic individuals had measurably lower gut diversity at baseline. Their microbiomes were missing key beneficial organisms (Faecalibacterium, Ruminococcus, Fusicatenibacter) and had elevated levels of potentially problematic genera (Clostridium, Klebsiella, Shigella). This is the signature the personalized formulations were designed to address.

6. Precision formulation pipeline

This is where the study departs from every other probiotic trial. Instead of giving everyone the same capsule, the team used a multi-step personalized pipeline:

Step 1: Metagenomic profiling

Each participant submitted a stool sample analyzed via Illumina NextSeq 550 sequencing (150 bp paired-end). Reads were aligned against the ~23,000 species reference database to produce a detailed species-level profile.

Step 2: Gap analysis

Each individual's profile was compared against the neurotypical reference panel and an internal database of healthy microbiome distributions, informed by the ML classifier's feature importance rankings. This identified which beneficial taxa were depleted and which potentially harmful taxa were elevated for that specific person.

Step 3: Custom synbiotic blend

Post-intervention taxa that increased

After 3 months, participants showed significant increases in:

Figure 6: Longitudinal microbiome trends — taxa that shifted after supplementation
Charts showing specific taxa that increased or decreased after 3 months of personalized synbiotic supplementation, including Bacillus subtilis, Bifidobacterium breve, and Lactobacillus species converging toward neurotypical levels.
Post-supplementation, key taxa like B. subtilis and Pseudoflavonifractor converged with neurotypical levels. Several Lactobacillus and Bifidobacterium species increased as expected from probiotic supplementation.
Source: Phan et al., mSystems 2024, Fig. 6. CC BY 4.0.

Manufacturing: All formulations produced under cGMP conditions by Flore Inc. at their facility in Joliet, IL. Each batch quality-tested for strain identity, viability (CFU count), and contaminant absence before shipping.

7. Microbiome outcomes post-intervention

The primary microbiome endpoints were changes in alpha diversity from baseline (T1) to 3-month post-supplementation (T2) among the 170 completers.

MetricWhat it measuresChange T1 → T2Significance
Shannon indexRichness + evenness combinedSignificant increasep < 0.05
Observed richnessTotal number of unique taxaSignificant increasep < 0.05
Pielou's evennessHow evenly taxa are distributedSignificant increasep < 0.05
Figure 2: Diversity comparisons — ASD vs. neurotypical, and longitudinal ASD changes
Box plots and PCoA ordination showing Shannon index, richness, and evenness comparisons between ASD baseline, ASD post-supplementation, and neurotypical controls. ASD diversity increases post-supplementation and converges with neurotypical levels.
Left panels: alpha diversity (Shannon, richness, evenness) at baseline and post-supplementation. Right panels: PCoA ordination showing community-level shifts. After supplementation, ASD diversity was no longer significantly different from neurotypical controls.
Source: Phan et al., mSystems 2024, Fig. 2. CC BY 4.0.

Key result: After supplementation, the ASD cohort's alpha diversity metrics were no longer significantly different from the neurotypical controls. The gut ecosystems became richer, more balanced, and more diverse — converging toward the healthy control profile.

Beta diversity (post-intervention)

ComparisonPCO1 (p.adj)PCO2 (p.adj)
ASD T1 vs. NT T10.0062.43 × 10-9
ASD T2 vs. NT T10.01652.62 × 10-8
ASD T1 vs. ASD T2No significant difference

The overall community composition (beta diversity) showed a shift toward the neurotypical cluster on PCO axes, though the ASD T1 vs. T2 comparison did not reach significance — meaning the shift was directional but modest at the whole-community level.

8. GI symptom outcomes (GSRS)

GI symptoms were measured using the Gastrointestinal Symptom Rating Scale (GSRS), a validated clinical instrument with five subscales scored 1–7 (1 = no discomfort, 7 = very severe).

Baseline GSRS scores

SubscaleBaseline mean (SD)Interpretation
Overall composite2.25 (0.97)Slight to mild discomfort
Abdominal pain3.37 (1.26)Mild to moderate discomfort
Constipation2.69 (1.61)Slight to mild
Indigestion2.39 (1.80)Slight to mild
Diarrhea2.01 (1.22)Slight
Reflux1.69 (1.16)No to slight discomfort

Post-supplementation

The composite GSRS score significantly decreased from T1 to T2 (Wilcoxon test, p < 0.05), indicating a reduction in GI symptom severity. Per the PGIA GI-specific question, 52% of participants reported GI improvement.

This is meaningful because GI symptoms affect up to 90% of autistic individuals and are correlated with behavioral challenges, sleep disruption, and reduced quality of life.

9. Behavioral and developmental outcomes

Multiple validated instruments were used. Here's what each one found:

PGIA (Parent Global Impressions of Autism)

A parent-reported measure where families rate change from baseline on a scale of −3 (much worse) to +3 (much better).

PGIA: Overall change (n=170 completers)
Some improvement
62%
No change
30%
Some worsening
6%
Overall average PGIA score at T2: 0.36 ± 0.55 (SD). Among those who improved: 0.69 ± 0.54. Source: Phan et al., mSystems 2024.

PGIA by domain (≥50% reported improvement)

Domain% Reporting improvement
Receptive language and comprehension≥50%
Expressive language and speech≥50%
Cognition and thinking≥50%
Gastrointestinal problems52%

Critical transparency note: The overall PGIA score of 0.36 ± 0.55 is within the placebo range. The authors themselves flag this: a 2013 placebo-controlled study by Adams et al. found a placebo group PGIA of 0.34 ± 0.54 — nearly identical. This means the behavioral improvements reported here cannot be distinguished from placebo effect based on this study design alone. The PGIA is a subjective parent-reported measure, and parents who enrolled (and stayed) are likely motivated, which can bias reporting.

SRS-2 (Social Responsiveness Scale, 2nd edition)

Baseline mean: 80.08 (SD 10.36). Distribution: 29% moderate, 6% severe, 6% mild-to-moderate.

Result: No significant change post-supplementation.

SCARED (Screen for Child Anxiety Related Disorders)

SubscaleBaseline mean (SD)% Meeting threshold
Overall19.34 (15.46)26.1% may have anxiety disorder
Separation anxiety4.42 (3.74)42.0%
Social anxiety5.49 (4.12)29.3%
School avoidance1.52 (1.87)27.4%
Generalized anxiety4.04 (4.44)19.1%
Panic disorder19.34 (15.46)17.8%

Result: No significant change post-supplementation.

Figure 7: Longitudinal survey assessments — GSRS, PGIA, SRS-2, and SCARED
Box plots showing longitudinal changes in GSRS (significant decrease), PGIA (62% improvement), SRS-2 (no significant change), and SCARED (no significant change) from baseline to post-supplementation.
GSRS showed significant improvement (GI symptoms decreased). PGIA showed 62% parent-reported improvement but scores are within placebo range. SRS-2 and SCARED showed no significant changes.
Source: Phan et al., mSystems 2024, Fig. 7. CC BY 4.0.

Summary of behavioral measures: The PGIA (subjective, parent-reported) showed 62% reporting improvement, but the score falls within placebo range. The SRS-2 and SCARED (standardized instruments) showed no significant changes. The honest interpretation: the strongest evidence from this study is in microbiome diversity and GI symptoms, not behavioral outcomes.

10. Metabolic pathway analysis

Using HUMAnN2 functional profiling, the study identified significant differences in metabolic pathways between ASD and neurotypical microbiomes at baseline:

Elevated in ASD microbiomes

Reduced in ASD microbiomes

Post-supplementation pathway changes

Several pathways significantly decreased after intervention, converging toward neurotypical levels:

Figure 4: Differential metabolic pathways between ASD and neurotypical cohorts
Extended error bar plot showing metabolic pathways significantly different between ASD and neurotypical microbiomes, including LPS biosynthesis, polymyxin resistance, and GABA degradation pathways.
Pathways to the right are more abundant in ASD; pathways to the left are more abundant in neurotypical controls. Note elevated LPS biosynthesis and polymyxin resistance in ASD, and reduced GABA degradation.
Source: Phan et al., mSystems 2024, Fig. 4. CC BY 4.0.

Gene family analysis

A Random Forest on gene families had an error rate of 15.35%. Of the top 50 discriminating gene families: 6 were higher in ASD, 44 were lower. Most of the depleted gene families were annotated from Ruminococcus spp., Fusicatenibacter saccharivorans, and Faecalibacterium prausnitzii — key butyrate producers and gut-health organisms.

Figure 5: Differential gene families between ASD and neurotypical cohorts
Extended error bar plot showing gene families significantly different between ASD and neurotypical microbiomes. 44 of 50 top features were lower in ASD, mostly from Ruminococcus, Fusicatenibacter, and Faecalibacterium.
44 of the top 50 discriminating gene families were depleted in ASD. Most originated from butyrate-producing species critical for gut barrier integrity.
Source: Phan et al., mSystems 2024, Fig. 5. CC BY 4.0.

11. Correlations

The study tested several correlations between microbiome metrics, behavior, and nutrition:

CorrelationPearson rp-valueDirection
Nutritional assessment ↔ PGIA−0.190.025Worse nutrition → worse ASD severity
Fruit servings ↔ Richness0.200.031More fruit → higher microbiome richness
Microbial evenness ↔ SCARED total−0.170.045Lower evenness → higher anxiety

Non-significant: Shannon, richness, and evenness were not significantly correlated with overall nutritional assessment, PGIA at baseline, SRS-2, or GSRS.

Subgroup analyses

Figure 1: Microbes associated with ASD subpopulations
Charts showing microbial differences within ASD subgroups based on SRS-2 severity, diet type, and other characteristics.
Microbial differences within the ASD cohort based on social responsiveness severity (SRS-2) and dietary patterns. Prevotella, Bacteroides, and Fusicatenibacter distinguished severe from non-severe groups.
Source: Phan et al., mSystems 2024, Fig. 1. CC BY 4.0.

12. Safety and tolerability

13. Limitations — in full

The authors themselves list these limitations. We believe in showing the full picture:

  1. Open-label, no placebo arm. Participants knew they were receiving the intervention. Placebo effect and expectation bias cannot be ruled out — especially for subjective measures like PGIA.
  2. PGIA scores are within placebo range. The overall PGIA improvement (0.36 ± 0.55) is nearly identical to the placebo group in Adams et al. 2013 (0.34 ± 0.54). The authors explicitly state: “Survey to measure improvements is comparable to a placebo effect.”
  3. SRS-2 and SCARED showed no significant change. Standardized behavioral instruments did not detect improvement — only the parent-reported PGIA did.
  4. Gender imbalance. ASD cohort was 79.7% male vs. 52.0% in controls. While no diversity differences by sex were found in this dataset, evidence from other studies suggests gender may influence the ASD microbiome.
  5. No control data on diet, GI, or behavior. The neurotypical controls provided microbiome samples only. No dietary, GI, or behavioral data was collected from controls, limiting comparison.
  6. 42.6% dropout rate. Only 170 of 296 completed the full protocol. Completers may systematically differ from dropouts — particularly since 34% dropped out for “lack of perceived benefit,” which biases the remaining sample toward those who felt it was working.
  7. Each participant got a different formula. This is inherent to the personalized approach but makes it impossible to attribute outcomes to any specific strain or combination.
  8. ML classifier doesn't generalize. The stochastic gradient boosting model achieved AUC 0.95 on training data but only 21.8% accuracy on validation — it overfits and cannot be used as a diagnostic tool.
  9. No long-term follow-up. Whether benefits persist after stopping supplementation is unknown.
  10. Not a randomized controlled trial (RCT). The gold standard for causal claims. The authors describe this as a pilot study. A proper RCT would be needed to establish efficacy.

Our position: This is meaningful pilot data. The microbiome diversity gains and GI symptom improvements are the strongest findings. The behavioral data is promising but cannot be separated from placebo effect given the study design. We don't claim to cure anything. We do claim this is the most evidence-backed personalized probiotic formulation available for families navigating autism — and that the science warrants continued investigation, ideally through a randomized controlled trial.

14. Full citation and disclosures

Phan J, Calvo DC, Nair D, Jain S, Montagne T, Dietsche S, Blanchard K, Treadwell S, Adams J, Krajmalnik-Brown R. “Precision synbiotics increase gut microbiome diversity and improve gastrointestinal symptoms in a pilot open-label study for autism spectrum disorder.” mSystems. 2024;9(5):e00503-24.

DOI: 10.1128/msystems.00503-24

PubMed Central: PMC11097633

License: Open access (CC BY 4.0)

Conflicts of interest

Transparency matters. Here are the declared conflicts:

The study was conducted through IRB collaboration between Sun Genomics, Arizona State University, and the Biodesign Center.

You read the whole thing. That says something.

Personalized probiotics backed by real, transparent science. Not perfect evidence — pilot evidence. But the best available for families navigating autism.

Get started