Why do you care if archive.org is indexing your website? They won't send you any real traffic. Be more concerned about getting listed in google, yahoo and msn. If you are not getting indexed there - then you have a problem.
If you are new to domains and looking to buy, sell and learn about domains then you have come to the right place. DNForum is the largest domain name community on the internet and continues to grow every day. There are over 105,000 domainers on DNForum doing everything from buying domains, selling domains, learning about domains and discussing domains. Take a minute and Register.
Register Today on DNForum IT'S FREE!I was looking for the history of a .COM that I own. (I bought it recently and put up a blog. I went to http://www.archive.org trying to make sure it doesn't have a negative history for me to clean up.)
Their site tells me it cannot find any info on my site due to a Robots.txt file. It says "We're sorry, access to basicpolitics.com has been blocked by the site owner via robots.txt. Their FAQ page says the robots.txt file forbids their web crawler from archiving any domain with such a file.
It offers me a link to view the Robots.txt file, but when I click on it, I get this, "Failed Connection. We're sorry. Your request failed to connect to our servers ."
I have searched all of the files on my server and do not find such a file. I never placed one there and I don't think anyone else has, either.
A previous owner may have had a Robots.txt file that is causing them to continue by-passing my site when their spiders are wandering about.
I sent them an e-mail to wayback@archive.org asking them how to get my site off their list of sites they don't archive. It has been several days and I have had no response.
Has anyone else encountered this?
How do I get my site off the list of what they don't archive?
I am the registrant of the site now, and I wish to get it archived, and otherwise taken notice of so I can drive more traffic to it.
Thank you in advance for your help.
toria
Last edited by biggedon; 02-06-2008 at 08:34 AM.
Selling: TrickApps.com, JobsRate.com, KushKing.com, FreeBakingRecipes.com, GridJobs.com, DailyGreenTip.com, DispensaryDirectories.com, DarkTrip.com, DrugPens.com, AllCoupons.co, BankCreditCards.co, ConsumerNews.co, CheekyGit.com, DiskTube.com, DITY.org
Why do you care if archive.org is indexing your website? They won't send you any real traffic. Be more concerned about getting listed in google, yahoo and msn. If you are not getting indexed there - then you have a problem.
Zombie Movie Bong of the Dead - Get it on DVD or via Digital Download Today! ~ "This is a sure winner." - Tommy Chong
How can I tell if google, yahoo and msn are indexing my site?
thanks,
toria
Selling: TrickApps.com, JobsRate.com, KushKing.com, FreeBakingRecipes.com, GridJobs.com, DailyGreenTip.com, DispensaryDirectories.com, DarkTrip.com, DrugPens.com, AllCoupons.co, BankCreditCards.co, ConsumerNews.co, CheekyGit.com, DiskTube.com, DITY.org
Last edited by dcristo; 02-05-2008 at 09:02 PM. Reason: Automerged Doublepost
go to each and type in your domain name with ext -i.e.- mydomain.com
You can also sign up at each for the webmaster tools. That way you can verify ownership, upload sitemaps, and check your indexing and backlinks etc.
https://www.google.com/webmasters/sitemaps/
http://webmaster.live.com/
https://siteexplorer.search.yahoo.com
Zombie Movie Bong of the Dead - Get it on DVD or via Digital Download Today! ~ "This is a sure winner." - Tommy Chong
Last edited by toria; 02-05-2008 at 09:10 PM. Reason: Automerged Doublepost
Selling: TrickApps.com, JobsRate.com, KushKing.com, FreeBakingRecipes.com, GridJobs.com, DailyGreenTip.com, DispensaryDirectories.com, DarkTrip.com, DrugPens.com, AllCoupons.co, BankCreditCards.co, ConsumerNews.co, CheekyGit.com, DiskTube.com, DITY.org
No worries. The reason I only gave the google site command is because you will probably find it will account for more then 90% of your search engine traffic.
Yahoo Site Explorer is also helpful:
http://siteexplorer.search.yahoo.com/
Just on dcristo's point, I've noticed I get different results for:
site:www.blah.com
site:blah.com
site:www.blah.com/folder1
site:blah.com/folder2
...to be sure, to be sure....
HTH
creat a robots.txt file and save it in the main directory of your site. the file should just be 2 lines:
User-agent: *
Disallow:
Network-Tools.com - Network Tools since 1998
upload a sitemap to webmaster tools, and get a PR3 or higher link or 2, and your site will be indexed in a couple of days.
PM me if you need a link I can give you one till your indexed
WOW! I really feel like a newbie now!
I know what a sitemap is and I downloaded a sitemap tool for Wordpress (what I'm using to create my blog). I haven't tried to use the sitemap tool yet, though.
But I don't know what it means to:
-- upload a sitemap to webmaster tools,
-- get a PR3 or higher link or 2
Sorry, those terms are not yet in my brain/database for reference. Can you explain a bit more?
Thanks for your kindness and patience with me.
Best,
Toria
Selling: TrickApps.com, JobsRate.com, KushKing.com, FreeBakingRecipes.com, GridJobs.com, DailyGreenTip.com, DispensaryDirectories.com, DarkTrip.com, DrugPens.com, AllCoupons.co, BankCreditCards.co, ConsumerNews.co, CheekyGit.com, DiskTube.com, DITY.org
sure, sign up for a google webmaster tools account, add your domain and verify it,
Then create a sitemap of your site using one of the free tools on the net, its basically a list of URL's of your site so it makes it easier for google to spider.
Upload the sitemap to your webmaster tools account.
If you have links to your site from other sites that google deems as important (high PR or Page Rank), you will get indexed alot quicker.
Selling: TrickApps.com, JobsRate.com, KushKing.com, FreeBakingRecipes.com, GridJobs.com, DailyGreenTip.com, DispensaryDirectories.com, DarkTrip.com, DrugPens.com, AllCoupons.co, BankCreditCards.co, ConsumerNews.co, CheekyGit.com, DiskTube.com, DITY.org
Bookmarks