# /robots.txt file for http://www.dotlove.com/ # mail webmaster@dotlove.com for constructive criticism # # Robots exclusion standard # see: http://www.robotstxt.org/ # http://www.searchengineworld.com/robots/robots_tutorial.htm # # # The format and semantics of the "/robots.txt" file are as follows: # The file consists of one or more records separated by one or more blank lines # (terminated by CR,CR/NL, or NL). Each record contains lines of the form # ":". The field name is case # insensitive. # # Comments can be included in file using UNIX bourne shell conventions: # the '#' character is used to indicate that preceding space (if any) and the # remainder of the line up to the line termination is discarded. Lines # containing only a comment are discarded completely, and therefore do not # indicate a record boundary. # # The record starts with one or more User-agent lines, followed by one or more # Disallow lines, as detailed below. Unrecognised headers are ignored. # # The presence of an empty "/robots.txt" file has no explicit associated # semantics, it will be treated as if it was not present, i.e. all robots # will consider themselves welcome. # # User-agent # The value of this field is the name of the robot the record is # describing access policy for. # If more than one User-agent field is present the record describes an # identical access policy for more than one robot. At least one field # needs to be present per record. # # The robot should be liberal in interpreting this field. A case # insensitive substring match of the name without version information # is recommended. # # If the value is '*', the record describes the default access policy # for any robot that has not matched any of the other records. It is not # allowed to have multiple such records in the "/robots.txt" file. User-agent: * # Disallow # The value of this field specifies a partial URL that is not to be # visited. This can be a full path, or a partial path; any URL that # starts with this value will not be retrieved. For example, # Disallow: /help disallows both /help.html and /help/index.html, # whereas Disallow: /help/ would disallow /help/index.html but allow # /help.html. # Any empty value, indicates that all URLs can be retrieved. At least # one Disallow field needs to be present in a record. Disallow: / #Disallow: /cgi-sys #Disallow: /cgi-bin #Disallow: /images # this appeared in google somehwere, dont recall how thou # does not seem to be part of the standard spec Allow: /*.html$