There was a problem with the original simulation in this post, which is hopefully now fixed.
While the estimate of sample standard deviation tends to grow along with the numerator as the mean deviates from μ0, this turns out to not have all that big an effect on power at "typical" significance levels, because in medium to large samples, s∗/n−−√ still tends to be large enough to reject. In smaller samples it may be having some effect, though, and at very small significance levels this could become very important, because it will place an upper bound on the power that will be less than 1.
A second issue, possibly more important at 'common' significance levels, seems to be that the numerator and denominator of the test statistic are no longer independent at the null (the square of x¯−μ is correlated with the variance estimate).
This means the test no longer has a t-distribution under the null. It's not a fatal flaw, but it means you can't just use tables and get the significance level you want (as we will see in a minute). That is, the test becomes conservative and this impacts the power.
As n becomes large, this dependence becomes less of an issue (not least because you can invoke the CLT for the numerator and use Slutsky's theorem to say than there's an asymptotic normal distribution for the modified statistic).
Here's the power curve for an ordinary two sample t (purple curve, two tailed test) and for the test using the null value of μ0 in the calculation of s (blue dots, obtained via simulation, and using t-tables), as the population mean moves away from the hypothesized value, for n=10:
n=10
You can see the power curve is lower (it gets much worse at lower sample sizes), but much of that seems to be because the dependence between numerator and denominator has lowered the significance level. If you adjust the critical values appropriately, there would be little between them even at n=10.
And here's the power curve again, but now for n=30
n=30
This suggests that at non-small sample sizes there's not all that much between them, as long as you don't need to use very small significance levels.