I think this was mentioned before, but it would be better if tpt used more cores. It would really boost the performance if it used more CPU cores, since the load can also be more evenly distributed across the cores and it can use more potential power to help in laggier saves. But it'd probably take 12.3201 months to implement this.
I'm sorry I made you sad. :(