Register Now for FREE! | | |
02-05-2008, 09:41 PM
|
#1 (permalink)
| | Domain Queen
Name: Victoria Last Online: Today 01:24 PM Join Date: Nov 2007
Posts: 414
DNF$: 5,525 Location: ZeroPoint Field
Country: | My domain is not being archived by spiders,crawlers, robots I was looking for the history of a .COM that I own. (I bought it recently and put up a blog. I went to http://www.archive.org trying to make sure it doesn't have a negative history for me to clean up.)
Their site tells me it cannot find any info on my site due to a Robots.txt file. It says "We're sorry, access to basicpolitics.com has been blocked by the site owner via robots.txt. Their FAQ page says the robots.txt file forbids their web crawler from archiving any domain with such a file.
It offers me a link to view the Robots.txt file, but when I click on it, I get this, "Failed Connection. We're sorry. Your request failed to connect to our servers ."
I have searched all of the files on my server and do not find such a file. I never placed one there and I don't think anyone else has, either.
A previous owner may have had a Robots.txt file that is causing them to continue by-passing my site when their spiders are wandering about.
I sent them an e-mail to wayback@archive.org asking them how to get my site off their list of sites they don't archive. It has been several days and I have had no response.
Has anyone else encountered this?
How do I get my site off the list of what they don't archive?
I am the registrant of the site now, and I wish to get it archived, and otherwise taken notice of so I can drive more traffic to it.
Thank you in advance for your help.
toria
__________________ GreenBuildingConstruction.com, TrueHouseValue.com, OneMillionJobs.com, EconomicalHeating.com, BlogForHire.com, CarCareFacts.com, ChicSunglasses.com, AdjustableCanes.com, Tweeko.com, CleanAirAlert.com
Last edited by biggedon; 02-06-2008 at 09:34 AM..
|
| |
02-05-2008, 09:48 PM
|
#2 (permalink)
| | Platinum Lifetime Member
Name: Roy Last Online: Today 04:45 PM Join Date: Jul 2006
Posts: 2,392
DNF$: 100 Location: Canada eh?
Country: | Why do you care if archive.org is indexing your website? They won't send you any real traffic. Be more concerned about getting listed in google, yahoo and msn. If you are not getting indexed there - then you have a problem. |
| |
02-05-2008, 09:59 PM
|
#3 (permalink)
| | Domain Queen
Name: Victoria Last Online: Today 01:24 PM Join Date: Nov 2007
Posts: 414
DNF$: 5,525 Location: ZeroPoint Field
Country: | Thanks - how do I tell? How can I tell if google, yahoo and msn are indexing my site?
thanks,
toria
__________________ GreenBuildingConstruction.com, TrueHouseValue.com, OneMillionJobs.com, EconomicalHeating.com, BlogForHire.com, CarCareFacts.com, ChicSunglasses.com, AdjustableCanes.com, Tweeko.com, CleanAirAlert.com |
| |
02-05-2008, 10:01 PM
|
#4 (permalink)
| | Platinum Lifetime Member
Last Online: Today 03:19 PM Join Date: Feb 2005
Posts: 2,666
DNF$: 1 Location: Australia
Country: | Quote:
Originally Posted by toria I was looking for the history of a .COM that I own. (I bought it recently and put up a blog. I went to http://www.archive.org trying to make sure it doesn't have a negative history for me to clean up.)
Their site tells me it cannot find any info on my site due to a Robots.txt file. It says "We're sorry, access to http://www.basicpolitics.com/ has been blocked by the site owner via robots.txt. Their FAQ page says the robots.txt file forbids their web crawler from archiving any domain with such a file.
It offers me a link to view the Robots.txt file, but when I click on it, I get this, "Failed Connection. We're sorry. Your request failed to connect to our servers ."
I have searched all of the files on my server and do not find such a file. I never placed one there and I don't think anyone else has, either.
A previous owner may have had a Robots.txt file that is causing them to continue by-passing my site when their spiders are wandering about.
I sent them an e-mail to wayback@archive.org asking them how to get my site off their list of sites they don't archive. It has been several days and I have had no response.
Has anyone else encountered this?
How do I get my site off the list of what they don't archive?
I am the registrant of the site now, and I wish to get it archived, and otherwise taken notice of so I can drive more traffic to it.
Thank you in advance for your help.
toria | Hi toria - Getting listed in archive.org does not drive more traffic to your site. Disregard the previous history of the site and concentrate on future traffic growth and profits Quote:
Originally Posted by toria How can I tell if google, yahoo and msn are indexing my site?
thanks,
toria | In google.com you use the site command, ie.
Last edited by dcristo; 02-05-2008 at 10:02 PM..
Reason: Automerged Doublepost
|
| |
02-05-2008, 10:04 PM
|
#5 (permalink)
| | Platinum Lifetime Member
Name: Roy Last Online: Today 04:45 PM Join Date: Jul 2006
Posts: 2,392
DNF$: 100 Location: Canada eh?
Country: | Quote:
Originally Posted by toria How can I tell if google, yahoo and msn are indexing my site?
thanks,
toria | go to each and type in your domain name with ext -i.e.- mydomain.com
You can also sign up at each for the webmaster tools. That way you can verify ownership, upload sitemaps, and check your indexing and backlinks etc. https://www.google.com/webmasters/sitemaps/ http://webmaster.live.com/ https://siteexplorer.search.yahoo.com |
| |
02-05-2008, 10:06 PM
|
#6 (permalink)
| | Domain Queen
Name: Victoria Last Online: Today 01:24 PM Join Date: Nov 2007
Posts: 414
DNF$: 5,525 Location: ZeroPoint Field
Country: | Thanks! Quote:
Originally Posted by dcristo Hi toria - Getting listed in archive.org does not drive more traffic to your site. Disregard the previous history of the site and concentrate on future traffic growth and profits.
In google.com you use the site command, ie. | Thanks dcristo. I really appreciate those of you who are willing to help us newbies!
Best,
toria Quote:
Originally Posted by whitebark | Thanks Whitebark, you guys are so incredibly helpful!
Best,
toria
__________________ GreenBuildingConstruction.com, TrueHouseValue.com, OneMillionJobs.com, EconomicalHeating.com, BlogForHire.com, CarCareFacts.com, ChicSunglasses.com, AdjustableCanes.com, Tweeko.com, CleanAirAlert.com
Last edited by toria; 02-05-2008 at 10:10 PM..
Reason: Automerged Doublepost
|
| |
02-05-2008, 10:10 PM
|
#7 (permalink)
| | Platinum Lifetime Member
Last Online: Today 03:19 PM Join Date: Feb 2005
Posts: 2,666
DNF$: 1 Location: Australia
Country: | Quote:
Originally Posted by toria Thanks dcristo. I really appreciate those of you who are willing to help us newbies!
Best,
toria | No worries. The reason I only gave the google site command is because you will probably find it will account for more then 90% of your search engine traffic.
Yahoo Site Explorer is also helpful: http://siteexplorer.search.yahoo.com/ |
| |
02-05-2008, 11:46 PM
|
#8 (permalink)
| | Platinum Lifetime Member
Last Online: 01-05-2009 12:12 AM Join Date: Aug 2007
Posts: 31
DNF$: 0 Location: DNForum | Google "site:" command Just on dcristo's point, I've noticed I get different results for:
site: www.blah.com
site:blah.com
site: www.blah.com/folder1
site:blah.com/folder2
...to be sure, to be sure....
HTH |
| |
02-06-2008, 12:07 AM
|
#9 (permalink)
| | DNF Addict
Last Online: Yesterday 01:58 PM Join Date: Nov 2003
Posts: 1,494
DNF$: 4,952 Location: New Jersey
Country: | creat a robots.txt file and save it in the main directory of your site. the file should just be 2 lines:
User-agent: *
Disallow: |
| |
02-06-2008, 12:08 AM
|
#10 (permalink)
| | Platinum Lifetime Member
Last Online: Today 03:19 PM Join Date: Feb 2005
Posts: 2,666
DNF$: 1 Location: Australia
Country: | Quote:
Originally Posted by dummyhalf Just on dcristo's point, I've noticed I get different results for:
site: www.blah.com
site:blah.com
site: www.blah.com/folder1
site:blah.com/folder2
...to be sure, to be sure....
HTH | You would have varied results if you have both www and non www pages indexed. This can be resolved by doing a 301 redirect from one version to the other. |
| |
02-06-2008, 03:49 AM
|
#11 (permalink)
| | DNF Addict
Name: Kris Last Online: 09-22-2008 04:56 AM Join Date: Aug 2006
Posts: 2,089
DNF$: 32 Location: Perth
Country: | upload a sitemap to webmaster tools, and get a PR3 or higher link or 2, and your site will be indexed in a couple of days.
PM me if you need a link I can give you one till your indexed |
| |
02-06-2008, 05:40 AM
|
#12 (permalink)
| | Domain Queen
Name: Victoria Last Online: Today 01:24 PM Join Date: Nov 2007
Posts: 414
DNF$: 5,525 Location: ZeroPoint Field
Country: | Quote:
Originally Posted by VirtualT upload a sitemap to webmaster tools, and get a PR3 or higher link or 2, and your site will be indexed in a couple of days.
PM me if you need a link I can give you one till your indexed | WOW! I really feel like a newbie now!
I know what a sitemap is and I downloaded a sitemap tool for Wordpress (what I'm using to create my blog). I haven't tried to use the sitemap tool yet, though.
But I don't know what it means to:
-- upload a sitemap to webmaster tools,
-- get a PR3 or higher link or 2
Sorry, those terms are not yet in my brain/database for reference. Can you explain a bit more?
Thanks for your kindness and patience with me.
Best,
Toria
__________________ GreenBuildingConstruction.com, TrueHouseValue.com, OneMillionJobs.com, EconomicalHeating.com, BlogForHire.com, CarCareFacts.com, ChicSunglasses.com, AdjustableCanes.com, Tweeko.com, CleanAirAlert.com |
| |
02-06-2008, 06:08 AM
|
#13 (permalink)
| | DNF Addict
Name: Kris Last Online: 09-22-2008 04:56 AM Join Date: Aug 2006
Posts: 2,089
DNF$: 32 Location: Perth
Country: | Quote:
Originally Posted by toria WOW! I really feel like a newbie now!
I know what a sitemap is and I downloaded a sitemap tool for Wordpress (what I'm using to create my blog). I haven't tried to use the sitemap tool yet, though.
But I don't know what it means to:
-- upload a sitemap to webmaster tools,
-- get a PR3 or higher link or 2
Sorry, those terms are not yet in my brain/database for reference. Can you explain a bit more?
Thanks for your kindness and patience with me.
Best,
Toria |
sure, sign up for a google webmaster tools account, add your domain and verify it,
Then create a sitemap of your site using one of the free tools on the net, its basically a list of URL's of your site so it makes it easier for google to spider.
Upload the sitemap to your webmaster tools account.
If you have links to your site from other sites that google deems as important (high PR or Page Rank), you will get indexed alot quicker. |
| |
02-06-2008, 06:17 AM
|
#14 (permalink)
| | Domain Queen
Name: Victoria Last Online: Today 01:24 PM Join Date: Nov 2007
Posts: 414
DNF$: 5,525 Location: ZeroPoint Field
Country: | Quote:
Originally Posted by VirtualT sure, sign up for a google webmaster tools account, add your domain and verify it,
Then create a sitemap of your site using one of the free tools on the net, its basically a list of URL's of your site so it makes it easier for google to spider.
Upload the sitemap to your webmaster tools account.
If you have links to your site from other sites that google deems as important (high PR or Page Rank), you will get indexed alot quicker. | Thanks, Kris. You're so helpful!
Best,
Toria
__________________ GreenBuildingConstruction.com, TrueHouseValue.com, OneMillionJobs.com, EconomicalHeating.com, BlogForHire.com, CarCareFacts.com, ChicSunglasses.com, AdjustableCanes.com, Tweeko.com, CleanAirAlert.com |
| | |
Currently Active Users Viewing This Thread: 1 (0 members and 1 guests) | | | | Thread Tools | | | | Display Modes | Linear Mode |
Posting Rules
| | | | All times are GMT -5. The time now is 10:40 PM. 
Copyright @2001-2008 DNForum.com
|