Description
The Bing bot generates a lot of page directories on disk for Calendar pages while scanning my wiki (ver. 1.9.4).
Steps to reproduce
- look in the web server log, e.g. /var/log/apache2/access.log
Example
Apache2 log: 65.52.104.87 - - [17/Jul/2012:18:13:54 +0200] "GET /wiki9/HelpOnMacros/MonthCalendar/2007-08-22?action=edit&template=MonthCalendarTemplate HTTP/1.1" 404 1938 "-" "Mozilla/5.0 (compatible; bingbot/2.0; +http://www.bing.com/bingbot.htm)" added page, e.g.: HelpOnMacros(2f)MonthCalendar(2f)1998(2d)02(2d)09
Component selection
- general
Details
MoinMoin Version |
1.9.4 |
OS and Version |
Ubuntu 10.04 |
Python Version |
|
Server Setup |
|
Server Details |
|
Language you are using the wiki in (set in the browser/UserPreferences) |
|
Workaround
Add to MoinMoin/config/multiconfig.py "bingbot"
('ua_spiders', ('archiver|bingbot|cfetch|charlotte|crawler|gigabot|googlebot|heritrix|holmes|htdig|httrack|httpunit|' 'intelix|jeeves|larbin|leech|libwww-perl|linkbot|linkmap|linkwalk|litefinder|mercator|' 'microsoft.url.control|mirror| mj12bot|msnbot|msrbot|neomo|nutbot|omniexplorer|puf|robot|scooter|seekbot|' 'sherlock|slurp|sitecheck|snoopy|spider|teleport|twiceler|voilabot|voyager|webreaper|wget|yeti'),
Discussion
This is an issue in general: if one starts editing, a pagedir / edit-log will get created (even if one never saves).
Plan
- Priority:
- Assigned to:
- Status: