1. Yes, the raw logs show everything that goes on on your site, as far as they've been configured to do. When using frames, the browser downloads each frame individually, so you'll have a call for both index.htm and header.htm.
See, that's what I don't get. There often is no log entry for index.htm, or the other way around (there's a log entry for index.htm, but not header.htm).
2. I'm not quite sure. As far as I believe, it should mean that the file was never downloaded. And, I don't think Yahoo would have used up 20 gigs. I don't think that indexing your entire forums would have taken up that much bandwidth. And Yahoo wouldn't download your .rar files. It would just make note of them. (If it did, Yahoo would suffer major problems trying to index file-hosting sites!)
I don't understand how it used 20 gigs then. (note: I've read before that Yahoo is a big bandwidth hog... I'm just trying to figure out HOW, as I didn't see it downloading anything big on my site --- the reason I wonder is to figure out if and how my raw logs are working. I don't actually care what the bots are doing: I just want everything to be recorded).
4. To quickly get you started on what each field is:
Thanks a lot for your help.