I just checked the data and I think my score comes from the fact that there around 80% sus overall. That is to say, the model is not as good as I thought. It predicts sus=0 all the time. But since the goal, at least stated in the docs, is to find out the 'evil' data, is it appropriate to stick to models trained with the sus given the fact that all the 'evil' values are 0?