It's a little unfair, yeah. But if I understand correctly, the reason they didn't include English speech is because it's much more difficult to do correctly -- English has too many nuances and contradictions in its spelling rules. They would either need to devote resources to making their own speech algorithm or pay to use one that's already been developed. Either way, it would increase the cost.
2そうだね プレイ済み