Google has started crawling my site, but from a temporary domain (beta.mydomain instead of just mydomain) and also I only want him to crawl just some of my pages. Therefore, I want to stop their crawl and only let them crawl pages I specify in a sitemap. How can I do that? (I know how to add a sitemap, but how can I stop their current crawling and request that they’ll crawl just the sitemap)
Update: If I kill the subdomain beta.mydomain – will that be “fine” by them or will they continue go over all killed pages and “not like” them? Can I specify that in each page’s header?
You could start with a robots.txt file.
See google’s info here
I presume you have already looked at webmaster tools and sitemaps from what you say? Do be aware that while a sitemap will help tell google WHAT to crawl, it won’t work very well for telling them what NOT to crawl.
For that you will want to use the robots.txt file to block certain pages / folders.