Hi,
I've been struggling with this problem for days. I had the crawler setup properly and started working on a front-end to replace the search page (I need security). After working on that for a few days, I noticed my results returned from our intranet kept getting smaller and smaller.
The Web crawler seemed to have stopped accepting cookies from the application (phpbb). I checked the log and the only cookies set when requesting a page are
Cookie: ASPSESSIONIDSSRDQDCB=FNJJKINDLMBLG
followed by basic authentication.
I use two levels of basic security for our site: basic authentication (which is working fine) and phpbb authentication to view the forums (phpBB). I found that by setting both to crawl with 2 different urls (e.g. website.com and website.com/phpbb), it applies both authentication methods to everything: everything has a basic authorization header, and until this broke, had cookies for phpbb.
I replayed the headers from the log and the server is indeed sending several set-cookie requests (the log shows them too), but none of the subsequent requests are giving the server the cookie data.
Is there a way to clear cookies? Is this a bug or configuration problem? Any insight at all? I've already cleared the index to reset the spider configuration. It
was working at one point.