Primary biliary cholangitis, an autoimmune condition, results in the gradual deterioration of the small bile ducts within the liver. Despite its slow progression, it inevitably culminates in cirrhosis and liver decompensation.
In this instance, the data originates from the Mayo Clinic trial carried out between 1974 and 1984, involving 312 participants in a randomized trial. Upon conducting a basic logistic regression analysis to forecast treatment outcomes, we identified three statistically significant covariates exhibiting the most substantial impact:
The covariates
need to be normalized to have zero sample mean and unit sample variance. In this use case, we will work with a smaller version of the dataset, containing only
N=100 samples.
The solution is to be given by two binary arrays
group1 and
group2, each of size
n=100. For a patient
i,
group1[i]=1 and
group2[i]=0 indicates that patient
i is assigned to group 1. If a patient is assigned to both groups (
group1[i]=1,
group0[i]=1) or to none (
group1[i]=0,
group0[i]=0) , the solution will be marked as unfeasible.
Let's look at a simple example. Consider 6 patients, with the following covariates:
The discrepancy value for this solution to our toy problem is