Which score is right?

I think this issue was addressed in the most recent dev Q&A:

At least, I think that’s the scoring issue they were talking about. The TL;DR: they know it’s an issue, but it’s requires big infrastructure changes that they don’t have an ETA for. We just have to live with it.