Web Hosting Forum | Lunarpages
News: October 6, 2008 - Submit Your Site for the October 2008 Site of the Month!
 
*
Welcome, Guest. Please login or register.
Did you miss your activation email?
October 12, 2008, 11:39:31 PM


Login with username, password and session length


Pages: [1] 2   Go Down
  Print  
Author Topic: Web listings I don't want  (Read 1365 times)
Parity
Space Explorer
***
Offline Offline

Posts: 7


« on: April 19, 2006, 09:03:54 AM »

Strangely, Google is not only listing my Web site but also listing my index file, forum, statistics about my forum, etc. All I want is my Web site. How can I get rid of all these extra listings, which are nobody's business and just clutter up the page?
Logged
TranzNDance
Princess of Naboo
Berserker Poster
*****
Offline Offline

Posts: 11809



WWW
« Reply #1 on: April 19, 2006, 09:15:03 AM »

Do you have example listing? I'm not sure what you want. Your index file is part of your web site. There is code you can put in meta tags or in robots.txt so that robots behave a certain way but you need to be more specific.

I don't see how it's strange. A lot of website owners wish more of their web sites were indexed by the search engine.
Logged

Grr..!! Luff Ya Grr..!! Luff Ya Grr..!! Luff Ya
Parity
Space Explorer
***
Offline Offline

Posts: 7


« Reply #2 on: April 19, 2006, 10:25:35 AM »

Here is an example of what a Google search is bringing up that I don't want. If you repond to the "more results" invitation, you get even more unwanted, irrelevant listings:
Parity for Peace - Statistics Center
Parity for Peace - Statistics Center. ... + Parity for Peace |- + Statistics Center. Parity for Peace - Statistics Center. General Statistics ...
www.parityforpeace.org/forum/index.php?PHPSESSID=5d7faa0ee2bac7338f096ad481cf18b4&action=stats - 24k - Cached - Similar pages

    Login
    Warning! You can't view the memberlist because you don't have permission to. Please login below or register an account with Parity for Peace. ...
    www.parityforpeace.org/forum/index.php?PHPSESSID=b6ab81f2c82e696b47b34ddd81814b56&action=mlist - 13k - Cached - Similar pages
    [ More results from www.parityforpeace.org ]
Logged
Martijn
Intergalactic Superstar
*****
Offline Offline

Posts: 180


WWW
« Reply #3 on: April 19, 2006, 11:13:23 AM »

If you don't want your forum to be indexed, create a file called robots.txt, and place it in your public_html directory. Add the following to this file to stop search engines from storing your forum contents:

User-agent: *
Disallow: /forum/

Logged
Parity
Space Explorer
***
Offline Offline

Posts: 7


« Reply #4 on: April 19, 2006, 11:31:41 AM »

Thanks so much for this information! I'll try it this evening. I've got to go back to work now.
Logged
Parity
Space Explorer
***
Offline Offline

Posts: 7


« Reply #5 on: April 19, 2006, 03:40:53 PM »

Hi, this is Parity again.

I tried what you suggested, but it doesn't seem to have worked. How long does it take to work? Then I read in another thread that you have to make the robots.txt file in Notepad. I've now done that, but I don't know how to upload the Notepad file into my public.html file. The Notepad file doesn't show up with my other files in the left-hand side of WS-FTP LE.

Thanks for your help.
Logged
bryantrv
Jabba the Hutt
*****
Offline Offline

Posts: 575


« Reply #6 on: April 19, 2006, 03:45:45 PM »

That only works to keep the next crawl of the website out, not on already crawled (indexed) information.
You need to look at http://www.google.com/webmasters/remove.html for info on removing already crawled stuff.
Logged

Parity
Space Explorer
***
Offline Offline

Posts: 7


« Reply #7 on: April 19, 2006, 04:07:03 PM »

Thanks. I finally located and uploaded by Notebook file. I'll just have to be patient until the next crawl, because I don't know how to to Meta tags.
Logged
easycall
Intergalactic Cowboy
*****
Offline Offline

Posts: 68



WWW
« Reply #8 on: April 20, 2006, 12:35:04 PM »

Since the pages are already indexed, putting up a robots.txt file will not remove them.  It will just keep the bots from re-indexing them.  You need to use Google's remove tool to get the pages removed, but be very careful or you might accidentally get your whole site removed.
Logged

Colleen
http://www.easycall.net/
--If you don't believe in something, you'll fall for anything.
Martijn
Intergalactic Superstar
*****
Offline Offline

Posts: 180


WWW
« Reply #9 on: April 20, 2006, 12:57:51 PM »

Google will remove them at the next crawl. Bots always scan for robots.txt and follow the rules. This is actually the only way to be pretty sure the listings will be removed.
Logged
Tom Veik
Intergalactic Cowboy
*****
Offline Offline

Posts: 62



« Reply #10 on: April 20, 2006, 06:19:51 PM »

My problem is that I have many outdated dead links that are still in the Google index that have been dead for close to four months now.  That Google help page says that dead links are removed automatically.  Maybe, but it sure is a slow process.  I get hundreds of 404 errors from search engines each week trying to index my dead links.
Logged
bryantrv
Jabba the Hutt
*****
Offline Offline

Posts: 575


« Reply #11 on: April 21, 2006, 06:50:33 AM »

You do have a custom 404 page set up, don't you? One with navigation leading back in to the main site.

I changed a bunch of urls a while ago (from .htm to .html ), and put up permanent redirects, which didn't take long to propagate.
Logged

Tom Veik
Intergalactic Cowboy
*****
Offline Offline

Posts: 62



« Reply #12 on: April 21, 2006, 06:31:11 PM »

You do have a custom 404 page set up, don't you? One with navigation leading back in to the main site.
Yes on both counts.  And it does properly return the 404 error in the header by the way.
« Last Edit: April 21, 2006, 06:39:30 PM by tlveik » Logged
easycall
Intergalactic Cowboy
*****
Offline Offline

Posts: 68



WWW
« Reply #13 on: April 21, 2006, 10:24:07 PM »

Recently Google tried to index a page that I haven't had for about 4 years.  It is a problem that many have reported on Webmasterworld.com - When I find Google trying to find a page that doesn't exist, I usually put up a 301 redirect to either the new page, or one that is similar.  That solves the problem as long as you keep the 301 redirect.  I had that redirect up 4 years ago, but took it down sometime since then and Google still went looking for it again just this month.  Very weird.
Logged

Colleen
http://www.easycall.net/
--If you don't believe in something, you'll fall for anything.
Parity
Space Explorer
***
Offline Offline

Posts: 7


« Reply #14 on: April 27, 2006, 12:02:44 PM »

Hii, I'm the one who started this thread. Can you believe that Google now shows a link to this thread? Obviously, they've done a crawl since I posted it, but my robots.txt file seems not to have worked: I'm still getting unwanted listings. I put it into Notepad and uploaded it into my public.html file as an ASCI  file. I have only one page (unless the forum counts as a page). Would redirecting even help here? Exactly how do you do it?
Logged
Pages: [1] 2   Go Up
  Print  
 
Jump to:  

Powered by MySQL Powered by PHP Powered by SMF 1.1.6 | SMF © 2006-2008, Simple Machines LLC

Valid XHTML 1.0! Valid CSS! Dilber MC Theme by HarzeM