Background
MASLD is a type of CLD defined by hepatic steatosis (>5 percent liver fat) in patients with at least one risk factor for cardiometabolic dysfunction (including obesity, hyperglycemia / type 2 diabetes mellitus (T2D), hypertension, and dyslipidemia). MASLD is commonly asymptomatic, and the natural history of the disease is usually fairly indolent with the majority of patients taking years or even decades to progress. The more severe form of disease, MASH, is diagnosed when histological evidence of inflammation and hepatocellular injury are present in addition to steatosis. As liver scarring accumulates, MASH may evolve to cirrhosis, which occurs more frequently in in those with T2D or prediabetes, morbid obesity, or more than one cardiometabolic risk factor.4,5 Advancing stages of fibrosis are associated with poor liver-related outcomes (e.g., hepatocellular carcinoma (HCC), liver decompensation, need for transplantation) as well as increased all-cause mortality.6 A meta-analysis of 1,495 subjects with 17,452 patient years of follow-up showed exponential increases in the liver-related mortality rate with each histologic stage of ?brosis.6 Therefore, accurate stratification for risk of fibrosis is one of the primary objectives in patient evaluation.
Mirroring the obesity epidemic, the burden of MASLD has been increasing worldwide, with a rise of nearly 50% in the global prevalence over the past three decades.7 Estimates of the prevalence of MASLD in the United States vary from 35 - 48%, with the highest rates among those of Hispanic origin.8 A recent study of patients with T2D in the US outpatient setting showed that ~70% have MASLD and approximately 15% have stage ≥2 fibrosis.9 Among North American patients with MASLD, ~30% of those without an indication for liver biopsy were reported to have MASH; the proportion with MASH rose to 60% in the presence of an indication for biopsy.5 MASH recently overtook Hepatitis C virus (HCV) as the leading etiology among US liver transplant patients with HCC, and remained second only to ALD in those without HCC.10 In 2021-2022, MASH was the primary listing diagnosis for 6,299 liver transplant patients, compared to 10,011 with ALD and only 1,793 with HCV.10
Historically, lifestyle modifications with diet and exercise have been the mainstay of therapeutic recommendations, with limited pharmacologic interventions available outside of the setting of T2D.11,12 However, in March of 2024, the Food and Drug Administration (FDA) approved resmetirom as the first pharmacotherapy for the treatment of adults with MASH with moderate to advanced (stages F2-F3) liver fibrosis on the basis of the randomized double-blind MAESTRO trial, which demonstrated significant improvement or resolution of liver scarring in patients receiving resmetirom compared to placebo.13 The FDA-approved drug label for resmetirom does not require liver biopsy to confirm advanced fibrosis.14 Additional agents are under active investigation.15
Evaluation of liver fibrosis
Liver biopsy has traditionally been considered the gold standard for evaluation of fibrosis, which is typically staged histologically on a scale ranging from F0 (no fibrosis) to F4 (cirrhosis). However, biopsy has several drawbacks. In addition to being an invasive procedure with the potential for associated complications such as pain and bleeding,16 the small size of the needle core relative to the liver can result in sampling error,17,18 and studies have demonstrated high inter-observer variability in pathologist reads.18 It is acknowledged by the provider community that liver biopsies are in fact not consistently performed in clinical practice.19, 20
The AASLD,19 AGA,21 AACE22 and American Diabetes Association (ADA)23 currently recommend a sequential approach to MASLD evaluation and risk assessment, with a primary rule-out assessment using FIB-4 to filter out patients at low risk for advanced fibrosis, followed by secondary assessment of the remaining intermediate- to high-risk patients with VCTE (an imaging-based assessment of fibrosis) or the Enhanced Liver Fibrosis (ELF) blood test. [The ELF test is also endorsed by the United Kingdom’s National Institute for Health and Care Excellence (NICE) for assessment for advanced liver fibrosis in people with MASLD.24] A repeat assessment cycle beginning with clinical evaluation and standard laboratory testing (e.g., FIB-4) may be considered every 1-2 years or 2-3 years depending on patient risk19,21 due to the slowly-progressive nature of the disease.25 The AASLD also issued a practice guidance update regarding selection of patients for prescription of resmetirom which favors the use of imaging but allows that blood-based testing may be utilized, and further notes that MAESTRO trial participants (all of whom had biopsy-proven F2-F3 fibrosis) had an ELF interquartile range of 9.2-10.4.26 While there are currently no defined thresholds by which to measure drug response, the AASLD recommends reassessment at 12 months with the same imaging- or blood-based modality that was used to determine treatment eligibility.26
The FIB-4 score, the first step in the guidelines’ evaluation cascade, is a calculation based on the patient’s age and readily available chemistry and hematology lab values according to the formula FIB-4 = (age × AST) / (platelet count × √ALT). In a comparison with seven other serum-based tests that utilize standard laboratory markers to predict F3-F4 fibrosis, the FIB-4’s area under the receiver-operating characteristic (AUROC) curve of 0.802 outperformed those of the NAFLD fibrosis score (NFS), Goteburg University Cirrhosis Index (GUCI), AST:ALT ratio, AST:platelet ratio index (APRI), AST:platelet ratio, body mass index (BMI) AST:ALT diabetes (BARD) score, and cirrhosis discriminant score (CDS), which ranged from 0.666 to 0.758.27 In a population of 541 confirmed MASLD patients (59% with MASH and advanced fibrosis prevalence of 23%), this study also showed that using FIB-4 values of ≤1.30 and ≥2.67 for the absence and presence of advanced fibrosis, respectively, had sensitivity of 0.74 with negative predictive value (NPV) of 90% and specificity of 0.98 with positive predictive value (PPV) of 80%.27 However, the proportion of FIB-4 scores falling in the indeterminate range has been reported to be as high as ~30%.28 Furthermore, due to the influence of age on the score, the FIB-4 test has been shown to have lower specificity in patients over 65 years old.29,30
After FIB-4, LSM is one option for the second step in the guidelines’ flowchart. VCTE, the most common technique by which to derive LSM, may be performed at the point of care but requires specialized equipment not typically found in primary care offices as well as experienced trained personnel that may not be available in all practice settings; >500 exams are generally needed to obtain competency.28 In addition, VCTE has a substantial failure rate (over 20%) in patients with severe (BMI ≥30) and morbid (BMI ≥40) obesity.31 Further confounding variables that affect VCTE accuracy include significant steatosis, inflammation, necrosis, cholestasis, venous congestion, and recent meal ingestion.31,32 Other imaging modalities, such as sound wave elastography (SWE) or magnetic resonance elastography (MRE), are sometimes offered but are overall less well characterized, and can similarly be limited by availability (particularly of bariatric MRE facilities) and high resource utilization.
The blood-based alternative to VCTE mentioned in guidelines as a follow-up to indeterminate/high FIB-4 is the ELF test, which is based on the combined quantitative measurements of hyaluronic acid, amino-terminal propeptide of type III procollagen, and tissue inhibitor of matrix metalloproteinase 1. The ELF test has FDA approval for use in MASH patients with advanced fibrosis (F3 or F4) to assess the likelihood of progression to cirrhosis and liver-related clinical events.33 Although not FDA-authorized for other indications, the ELF test has been extensively investigated for detecting advanced fibrosis in MASLD. However, the literature is complicated by the use of different formulas and thresholds. In a meta-analysis of 11 studies of the ELF test, the scoring systems were standardized and a multiple thresholds model was applied to generate a summary receiver-operating characteristic (SROC) curve, which showed an area under the curve (AUC) of 0.83 (95% CI 0.71–0.90) for detecting advanced fibrosis compared to biopsy.34 Using the guideline-recommended lower threshold of 7.7 and upper threshold of 9.8, the ELF test had a sensitivity of 0.93 for excluding fibrosis and specificity of 0.86 for advanced fibrosis; at a disease prevalence of 50%, NPV is 83% and PPV is 82%.34 Therefore, the authors suggested that the ELF test is better suited for higher-prevalence settings,34 such as would be seen in a pre-selected cohort with FIB-4 ≥1.3.
In a comparison of ELF to liver stiffness measurement (LSM) by FibroScan® (an FDA-approved VCTE system) for the prediction of liver fibrosis in patients with biopsy-confirmed MASLD, no significant difference was found in the AUROC curves (ELF=0.812 and LSM=0.839).35 A retrospective, cross-sectional study of 829 patients (462 with transient elastography data and 463 with liver biopsy data), showed that a significant increase in ELF score was correlated with advanced fibrosis.36 The AUROC for ELF for identifying fibrosis was 0.81 (95% CI, 0.77 to 0.85) with biopsy as the standard and 0.79 (95% CI, 0.75 to 0.82) with transient elastography as the reference.
Other currently available tests related to MASLD include NASH Fibro-Test / NASH FibroSure and FibroSpect, which originated for use in the setting of Hepatitis C, and LiverFASt. NASH Fibro-Test / FibroSure has been better studied for its ability to predict steatosis37 or steatohepatitis38 than fibrosis.39 In a single retrospective study, FibroSpect had a higher AUROC than FIB-4 and NFS, but the latter values were lower than expected based on other studies.40 Similarly, while data on LiverFASt have been presented at conferences, peer-reviewed publications are scarce.41
“At risk” MASH
In addition to the identification of patients with advanced fibrosis, an area of active interest is the detection of “at-risk” MASH, defined as steatohepatitis with an NAFLD activity score (a semi-quantitative measurement of histology injury that takes into account steatosis, hepatocyte ballooning and inflammation) of 4 or greater and fibrosis stage of at least 2. Current AASLD guidelines note that “serum and plasma-based lipidomic, metabolomic, and proteomic biomarkers are in development for ‘at-risk’ [M]ASH” but “have not reached the level of clinical evidence needed for use in routine clinical practice.”19
Another patented test with algorithm is the NIS4, which combines measurements of alpha-2 macroglobulin, miR-34a-5p, YKL-40 and hemoglobin A1c (HbA1c). In one study reporting results from two independent cohorts, the NIS4 was similar to VCTE (AUROCs 0.76 and 0.75, respectively) and better than ELF (AUROCs 0.83 and 0.77, respectively) and FIB-4 (AUROCs 0.80 and 0.74, respectively) at identifying at-risk MASH.42 The NIS2+, an optimization of the NIS4, removes alpha-2 macroglobulin and HbA1c from the NIS4 equation and adds sex-corrections (both independent and for miR-34a-5p), resulting in altered threshold values. The NIS2+ was statistically better than the NIS4 (AUROC 0.813 vs. 0.792, respectively) at categorizing at-risk MASH in patients being vetted for the RESOLVE-IT trial for the safety and efficacy of elafibranor treatment.43 In a study dedicated to demonstrating performance in the Medicare-aged population, NIS4 and NIS2+ had similar results for the detection of at-risk MASH in subjects <65 years (.084 sensitivity for both at the low cutoffs, and specificities of 0.844 and 0.855 respectively at the high cutoffs) while NIS2+ had better specificity in those >65 years (0.677 vs. 0.811 at high thresholds, respectively); both tests outperformed FIB-4, NFS and ELF for this indication.44 The AGA states that the NIS2+ is validated for the detection of at-risk MASH.21
Another test developed for flagging at-risk MASH is the metabolomics advanced steatohepatitis fibrosis score (MASEF), which is based on 12 lipids, BMI, AST, and ALT. In a validation cohort of over 500 patients, the MASEF score had an AUROC of 0.79 (95% CI 0.75–0.83) for predicting at-risk MASH, with a sensitivity of 78%, specificity of 65%, PPV of 48%, and NPV of 88%. In addition, there was a non-significant difference between FIB-4 + MASEF and FIB-4 + LSM by VCTE.45
The Foundation for the National Institutes of Health (FNIH) - Non-invasive Biomarkers of Metabolic Liver Disease (NIMBLE) consortium is a public-private partnership collaboration which aims to support regulatory approval of MASH-related biomarkers. In stage one of their efforts, FNIH NIMBLE prespecified successful performance metrics as an AUROC ≥0.7, superiority over ALT for disease activity, and superiority over the FIB-4 test for fibrosis severity. The fully blood-based panels prioritized for initial study were NIS4, OWLiver (which includes MASEF), ELF and PROC3. In an observational cohort of over 1000 patients representing the full spectrum of MASLD, the ELF and NIS4 tests met AUROC requirements; NIS4 outperformed ALT for activity score and ELF outperformed FIB-4 for all fibrosis endpoints when using the Youden cutpoint as well as when constraining to 90% sensitivity and 90% specificity.46 OWLiver could not be fully evaluated due to the categorical nature of the results. In the next stage of the project, these tests will be advanced for further testing in patients with clinical risk factors in more specific intended use populations. The authors also note that this overarching framework can be applied to other promising tests in development, including combinatorial approaches that integrate imaging and blood measurements.46