I am using a simple php script to estimate the number of online users by counting the number of apache session files in my tmp directory. Every now and then the # of users spikes up and remains inflated for several days before returning to normal. I checked today during another inflated period, and it looks like the bulk of the sessions are coming from google’s crawler (crawl-66-249-71-56.googlebot.com).
So my questions:
- Is there a reason there are over 100 sessions coming from the google crawler?
- Why does this happen periodically over a span of several days and then go away?
- What can I do to filter this information?
Thanks.