Actually, a sample size of 10 can be good enough or not depending of the distributions of what you are measuring. If the difference between both samples is very obvious, 10 can be a sufficient size for it to be statistically significant.
You can use the website http://faculty.vassar.edu/lowry/VassarStats.html for significance testing, go to "t-tests and procedures" and to "two-sample t-test for independent correlated samples". There you would have to click "Independent samples", and in Sample A, write the PV's for your 10 dwarven wizzies and in Sample B for the elven ones. Then look at "Results", "P", "two-tailed". If the value of P is less than .01 (or .005, depending on how demanding you are); then your data are good enough and you can already conclude that there is in fact a difference between your two data sets. If the value is more than that, then either there is no difference, or there is a difference but you need to collect more data to get a significant test.