# By default we allow robots to access all areas of our site accessible to anonymous users, except for search, which burns our CPU for no reason. User-agent: * Disallow: /search Disallow: /cidd Disallow: /cidd/ Disallow: /cidd/* Disallow: webcam.php Disallow: /webcam.php Disallow: /help Disallow: /help/ Disallow: /help/* Disallow: /testtalks Disallow: /testtalks/ Disallow: /testtalks/* Disallow: /mscwebcam Disallow: /mscwebcam/ Disallow: /mscwebcam/* Disallow: /clusterhire Disallow: /clusterhire/ Disallow: /clusterhire/* # Add Googlebot-specific syntax extension to exclude forms # that are repeated for each piece of content in the site # the wildcard is only supported by Googlebot # http://www.google.com/support/webmasters/bin/answer.py?answer=40367&ctx;=sibling User-Agent: Googlebot Disallow: /*sendto_form$ Disallow: /*folder_factories$ Disallow: /*?searchterm=* # Penn State's Google Search Appliance comes at some servers so hard and fast that it burns 60% of their CPU. Limit what it spiders: User-Agent: PennStateSpider Disallow: /*sendto_form$ Disallow: /*folder_factories$ Disallow: /*?searchterm=*