First, here is a StataCorp faq on what they do for cluster-adjusted robust SE's: h
ttp://www.stata.com/support/faqs/statistics/references/
Secondly, esp. re: small number of clusters, I recommend the following article: Cameron, AC and Miller, DL (2011), "Robust inference with clustered data," in A Ullah and DE Giles, _Handbook of Empirical Economics and Finance_, CRC Press, pp. 1-28; their basic answer is that the cluster-adjusted SE's are should not be used with a small number of clusters; they suggest a variation on clustered bootstrap for which there is some, but not full, code available on-line.
Richard Goldstein