Valuate Domain Names
DNForum - Domain Sales, Domain Forum, Domain Appraisals, Domain Registrars
HomeRegisterMembershipsGetting StartedDomain Tools Domain EbooksSEO Software Domain Resellers Advertise

Go Back   DNForum - Domain Sales, Domain Forum, Domain Appraisals, Domain Registrars > Content Development > Website Development and Design Discussion > Website Reviews
Register Search Today's Posts Mark Forums Read

Reply
 
LinkBack Thread Tools Display Modes
Old 05-02-2009, 10:53 PM   #61 (permalink)
Platinum Lifetime Member
 
kengreenwood's Avatar
 
Name: That shouldn't be too hard to figure out...
Last Online: 10-29-2009 08:46 AM
iTrader: (2)
Join Date: May 2006
Posts: 377
DNF$: 4,437
Location: Tampa
Country:


Quote:
Originally Posted by Acro View Post
That would probably kill the server this killer app is running on
Yes - Theo is correct. Everyone needs to remember that there are 100 million records being searched through. And the searches, for the most part, are pretty quick. That's because I have many, many tables in the database and point the search to the correct table based on what it starts with or what it ends with... (that's thousands of tables). To have it work with "contains" is extremely difficult on a dataset this large. It works on the dropping domains because the dataset is much smaller... on the active domains - probably not gonna happen.
__________________
KJG
OneWorldMedia, ZFBot

Last edited by kengreenwood; 05-02-2009 at 11:05 PM..
kengreenwood is offline   Reply With Quote
Sponsored Ads
Old 05-21-2009, 11:53 PM   #62 (permalink)
Platinum Lifetime Member
 
kengreenwood's Avatar
 
Name: That shouldn't be too hard to figure out...
Last Online: 10-29-2009 08:46 AM
iTrader: (2)
Join Date: May 2006
Posts: 377
DNF$: 4,437
Location: Tampa
Country:


Update - I've overhauled the entire zfbot process... I'm now just loading a single, VERY large and indexed, MySQL table as opposed to 1500 or so tables. The indexes take care of what the multiple tables were handling.

Since I'm no longer processing data all day long, I've changed the scroll from a status to the DNJournal RSS Feed

Still working on further functionality and updates...

http://www.zfbot.com

KJG
__________________
KJG
OneWorldMedia, ZFBot
kengreenwood is offline   Reply With Quote
Old 05-22-2009, 01:30 AM   #63 (permalink)
DNF Member
No Avatar
 
Last Online: 11-03-2009 10:17 PM
iTrader: (22)
Join Date: Apr 2005
Posts: 463
DNF$: 4,957
Location: ysl
Country:


Great Service,
thank you!
__________________
NameDrives.com Leading The Next Boom


Somatic is offline   Reply With Quote
Old 05-22-2009, 12:46 PM   #64 (permalink)
Platinum Lifetime Member
 
kengreenwood's Avatar
 
Name: That shouldn't be too hard to figure out...
Last Online: 10-29-2009 08:46 AM
iTrader: (2)
Join Date: May 2006
Posts: 377
DNF$: 4,437
Location: Tampa
Country:


Finally got the scripts down so that I can cron everything up. The entire process of downloading the com, net and org zonefiles, cleaning up the data in unix, bulk loading it into the indexed MySQL tables and doing updates takes about 4 hours. Not bad for 100 million records.
__________________
KJG
OneWorldMedia, ZFBot
kengreenwood is offline   Reply With Quote
Old 05-22-2009, 01:09 PM   #65 (permalink)
Bloody lovely
 
Acro's Avatar
 
Last Online: Today 05:49 PM
iTrader: (393)
Join Date: Feb 2004
Posts: 23,740
DNF$: 3,447
Location: USA
Country:




Awesome upgrades. Any time now, it will also brew coffee
__________________

DomainGang.com - Domainers' Most Awesome News Source
Acroplex - Web & Graphics
Acro.net - My Blog
Acro is online now   Reply With Quote
Old 05-22-2009, 01:29 PM   #66 (permalink)
Wordpress Guru
 
tonyfloyd's Avatar
 
Name: Tony
Last Online: Today 07:41 AM
iTrader: (58)
Join Date: Apr 2007
Posts: 3,499
DNF$: 10,787
Location: New York
Country:


Ken,

Sent u a PM a few days ago....any thoughts?

Thanks!
Tony
__________________

Web Design | Italian Recipes | Keyword Links
tonyfloyd is offline   Reply With Quote
Old 05-28-2009, 10:28 PM   #67 (permalink)
Platinum Lifetime Member
 
kengreenwood's Avatar
 
Name: That shouldn't be too hard to figure out...
Last Online: 10-29-2009 08:46 AM
iTrader: (2)
Join Date: May 2006
Posts: 377
DNF$: 4,437
Location: Tampa
Country:


All - you should see a marked improvement in the performance of the ZFBot application. I've added a bunch of indexes on the large zone file table (over 100 million records on that table) and the searches are coming back quicker. I've noted that any search that yields a large number of domains tends to never come back - and that large number of records value is dependent on the memory on your local machine. There is no limitation on the number of records the datagrid can hold - it's just the memory on your machine. For example, on my laptop the queries that yield over 150,000 domains never come back for me... it may be higher or lower for each of you. I'm working on adding paging functionality to the datagrid that displays the resulting domains so that I can avoid the local machine memory issue and return an unlimited number of domains. For now, keep this in mind and just narrow down your searches by using the options of domain length, excluding dashes, numbers etc... and you'll be fine.

On a side note - I apologize for being out of pocket for the last couple weeks - I just sold my house and moved to a new one... hired a moving company for the big stuff but I moved a lot of stuff myself ... every bone/muscle in my body is aching... including my (typing) fingers. ouch.

5/29/09 11:10 AM - optimizing the large zfbot table right now ... queries will run endlessly until it's complete.. sorry!

OK - everything is kosher again. I removed the auto-search on load and replaced it with a welcome swf...

check it out... http://www.ZFBot.com
__________________
KJG
OneWorldMedia, ZFBot

Last edited by kengreenwood; 05-29-2009 at 03:18 PM.. Reason: Automerged Doublepost
kengreenwood is offline   Reply With Quote
Old 05-30-2009, 04:26 PM   #68 (permalink)
Platinum Lifetime Member
 
kengreenwood's Avatar
 
Name: That shouldn't be too hard to figure out...
Last Online: 10-29-2009 08:46 AM
iTrader: (2)
Join Date: May 2006
Posts: 377
DNF$: 4,437
Location: Tampa
Country:


I added the ability to search for CVCV and CVVC or both within the "dropping domain" search. You'll see that the "Parking Breakdown" button turn into a dropdown when you check the "It's Dropping" checkbox. And if you select CVCV, CVVC or CVCV/CVVC, most of the options on the top will become disabled (the starts/end/contains dropdown, the domain text input box, the dash/number filters and the length filters). Once you uncheck either the "It's Dropping" checkbox or change the dropdown value from CVCV,CVVC or CVCV/CVVC, all of those filters will be re-enabled.
__________________
KJG
OneWorldMedia, ZFBot
kengreenwood is offline   Reply With Quote
Old 05-30-2009, 04:28 PM   #69 (permalink)
Bloody lovely
 
Acro's Avatar
 
Last Online: Today 05:49 PM
iTrader: (393)
Join Date: Feb 2004
Posts: 23,740
DNF$: 3,447
Location: USA
Country:




Somehow, the day that it *will* be making coffee too is near!
__________________

DomainGang.com - Domainers' Most Awesome News Source
Acroplex - Web & Graphics
Acro.net - My Blog
Acro is online now   Reply With Quote
Old 05-30-2009, 04:29 PM   #70 (permalink)
Platinum Lifetime Member
 
kengreenwood's Avatar
 
Name: That shouldn't be too hard to figure out...
Last Online: 10-29-2009 08:46 AM
iTrader: (2)
Join Date: May 2006
Posts: 377
DNF$: 4,437
Location: Tampa
Country:


by the way, the main table is being updated as i type this... queries will have to wait for a few minutes...

ok - all done... and Theo - it's gonna grind the coffee beans too ...

Might as well add a few more to the dropping domains filter...

For those of you who like looking for 4 letter domains - Now there is selection for cvcv, cvvc, vcvc, vccv and cvcv/cvvc/vcvc/vccv all together.
__________________
KJG
OneWorldMedia, ZFBot

Last edited by kengreenwood; 05-30-2009 at 10:03 PM.. Reason: Automerged Doublepost
kengreenwood is offline   Reply With Quote
Old 06-02-2009, 05:33 PM   #71 (permalink)
CrossLogix.com
 
copper's Avatar
 
Last Online: Today 04:44 PM
iTrader: (65)
Join Date: Mar 2006
Posts: 2,238
DNF$: 2,167
Location: Matthews, NC. U


I hope adding more functions won't slow it down too much
__________________

Domain Names For Sale
copper is offline   Reply With Quote
Old 06-02-2009, 06:05 PM   #72 (permalink)
 
aZooZa's Avatar
 
Name: Dale Hubbard
Last Online: Yesterday 12:09 PM
iTrader: (45)
Join Date: Jan 2003
Posts: 5,868
DNF$: 5,845
Location: Exeter, England
Country:


Like spltting it up into a-z
__________________
UK Drop Catching Services: Dropsystem.co.uk
New! Canada TBR Drop Catching: Dropping.ca
New! QUALITY MiniSites: NOTsoMINI.com
aZooZa is offline   Reply With Quote
Old 06-09-2009, 12:04 PM   #73 (permalink)
Platinum Lifetime Member
 
kengreenwood's Avatar
 
Name: That shouldn't be too hard to figure out...
Last Online: 10-29-2009 08:46 AM
iTrader: (2)
Join Date: May 2006
Posts: 377
DNF$: 4,437
Location: Tampa
Country:


Update

If I start seeing a significant performance drop off, I'll certainly remove some functionality. But for now, it's full speed ahead...

I'm almost done with the portfolio search - It's going to be a major upgrade and it will allow users to load up their entire portfolio of domains and get a complete result set of matches - begins with, ends with and exact.

Here are a couple of the screen shots - still a work in progress but getting close:
First column is the users domain (in this case, mine), second column is the matched domain, then you've got Google PageRank, number of Google indexed pages, number of Google backlinks, the Alexa rank, whether it's in DMOZ and the match type. All of the columns are sortable and I'm currently working on integrating the same filters that you use on the main search into this view. (In this view I happened to sort it by the Alexa rank). Every column except the first (the users domain) is related to the matched domain. This gives you a quick way to see any matched domains that are likely to have full blown websites associated with them versus parked sites.



This view shows you the domains that matched that are also about to drop. This can be helpful if you're looking to pick up and similar domains to yours... The date it's dropping becomes visible when you click the "dropping" checkbox in this view.



The process of getting the matches for a set of 300 of my domains took about 5 minutes so it'll be something that I'll give the users the ability to cue up and run on a regular basis.... and I'm going to add to the view a column that displays the last date updated on each domain so you can see any newly matched domains. Next steps beyond this are to send email alerts for specific saved queries.

If you want to submit functionality requests - just PM me.
__________________
KJG
OneWorldMedia, ZFBot
kengreenwood is offline   Reply With Quote
Old 06-09-2009, 01:34 PM   #74 (permalink)
Bloody lovely
 
Acro's Avatar
 
Last Online: Today 05:49 PM
iTrader: (393)
Join Date: Feb 2004
Posts: 23,740
DNF$: 3,447
Location: USA
Country:




I must say that you don't need to give Ken any ideas out of kidding; he will implement them So now ZFBot makes coffee as well

The output is also impressive. It's an expandable Excel document that simplifies the human (visual) scanning of the results.
__________________

DomainGang.com - Domainers' Most Awesome News Source
Acroplex - Web & Graphics
Acro.net - My Blog
Acro is online now   Reply With Quote
Old 06-09-2009, 03:03 PM   #75 (permalink)
CrossLogix.com
 
copper's Avatar
 
Last Online: Today 04:44 PM
iTrader: (65)
Join Date: Mar 2006
Posts: 2,238
DNF$: 2,167
Location: Matthews, NC. U


That's really impressive stuff!

How about giving users options of choosing to display or
no display items like PR, GIP, GBL, Alexa, DMOZ and Drop Date?

I am thinking it may speed up the processing speed if
users choose not to display some items and also reduce server load?

BTW, I prefer ESPRESSO!
__________________

Domain Names For Sale
copper is offline   Reply With Quote
Old 06-09-2009, 08:02 PM   #76 (permalink)
Platinum Lifetime Member
 
kengreenwood's Avatar
 
Name: That shouldn't be too hard to figure out...
Last Online: 10-29-2009 08:46 AM
iTrader: (2)
Join Date: May 2006
Posts: 377
DNF$: 4,437
Location: Tampa
Country:


Quote:
Originally Posted by copper View Post
That's really impressive stuff!

How about giving users options of choosing to display or
no display items like PR, GIP, GBL, Alexa, DMOZ and Drop Date?

I am thinking it may speed up the processing speed if
users choose not to display some items and also reduce server load?

BTW, I prefer ESPRESSO!
Yeah, the performance is going to be an issue with the stats I'm afraid. Here's the deal (and if anyone has any ideas for work arounds, I'm all ears)... To get the matches is no problem - takes a reasonable amount of time even on large data sets. I ran through a list of domains from one of of the domainers here of around 1,100 domains and it resulted in a matched list of 450,000 in around 10 minutes. But, getting the PR, GIP, GBL, Alexa, and DMOZ values for those domains takes around 5 seconds per domain (via a PHP script I am using). That may not sound like a lot of time but on a record set of 450,000 - it would take nearly an entire month to complete. That's because in this set of domains, there were a handful that each had well over 10,000 matches and a few had nearly 50,000 matches. Nice domain names to say the least. Even on a small set of data - the 200 domains I ran through produced around 10,000 matches and the processing time for the stats would be almost 6 days. Not acceptable obviously. I'm going to have to tinker around with which stats are most important and how I want to display them... it may need to be a button that you press on each domain and then it retrieves those stats. Unless I get some script that is able to run SIGNIFICANTLY faster. Something like processing 5 domains a second would be acceptable - that would result in the 450,000 matches being done in a day and a more reasonable set of around 50,000 in a couple hours. Anyway, I'll still be getting the portfolio match with the dropping domains completed regardless of the stats... stay tuned.

Just finished doing some research - technically, it's against Google's TOS to run an automated PR checker on your server. It takes up Google's bandwidth and they frown upon it. There are obviously tons of scripts out on the net you can find to get the PR - many of them paid for scripts and programs such as the domainpunch.com tool. However, you can probably get around the "automated" terminology because the user is technically kicking off the request... therefore it's manual. The process I'm currently running would probably be deemed automated since it's a single script going out and grabbing tens of thousands of PR's. Sooooo, I'll probably go the route of getting the domain matches for the portfolio and allowing a user to click the domain to get the PR, GIP, GBL, etc... once the data has been requested, I can store it locally and display it from that point forward without having to recall it through Google. There would need to be a "refresh" button the user can click to update the data again at some point...
__________________
KJG
OneWorldMedia, ZFBot

Last edited by kengreenwood; 06-09-2009 at 09:38 PM.. Reason: Automerged Doublepost
kengreenwood is offline   Reply With Quote
Old 06-11-2009, 04:19 PM   #77 (permalink)
Platinum Lifetime Member
 
kengreenwood's Avatar
 
Name: That shouldn't be too hard to figure out...
Last Online: 10-29-2009 08:46 AM
iTrader: (2)
Join Date: May 2006
Posts: 377
DNF$: 4,437
Location: Tampa
Country:


I added a chat room on ZFbot.com ... you'll see a radio button to access it above the stats grid on the right...
__________________
KJG
OneWorldMedia, ZFBot
kengreenwood is offline   Reply With Quote
Old 06-11-2009, 04:41 PM   #78 (permalink)
CrossLogix.com
 
copper's Avatar
 
Last Online: Today 04:44 PM
iTrader: (65)
Join Date: Mar 2006
Posts: 2,238
DNF$: 2,167
Location: Matthews, NC. U


Chat room?
Where is my espresso :(
__________________

Domain Names For Sale
copper is offline   Reply With Quote
Old 06-24-2009, 11:21 PM   #79 (permalink)
Platinum Lifetime Member
 
kengreenwood's Avatar
 
Name: That shouldn't be too hard to figure out...
Last Online: 10-29-2009 08:46 AM
iTrader: (2)
Join Date: May 2006
Posts: 377
DNF$: 4,437
Location: Tampa
Country:


I made some changes to the zfbot indexes and query code... domain searches are coming back MUCH more quickly now.
__________________
KJG
OneWorldMedia, ZFBot
kengreenwood is offline   Reply With Quote
Old 06-26-2009, 11:54 AM   #80 (permalink)
Platinum Lifetime Member
 
kengreenwood's Avatar
 
Name: That shouldn't be too hard to figure out...
Last Online: 10-29-2009 08:46 AM
iTrader: (2)
Join Date: May 2006
Posts: 377
DNF$: 4,437
Location: Tampa
Country:


Updates... 06/26/09

Some minor updates and issues:

1. I added the ability to search for CVCVC and VCVCV dropping domains.
2. I added code to restrict all special characters in the search (~`!@#$%^&*()_+=[]{}|\:;"',.<>/? and space)
3. Nearly done with the registration and portfolio upload/match. Just need to create the form to allow users to upload and then maintain their domains. Once domains are uploaded, there is a script that runs every half hour that will check for matching domains for any domain that hasn't yet been matched. All already matched domains will be re-checked for changes nightly. Users will have the ability to see matching domain changes over the last 24 hours, 7 days and 30 days or can just pull up all matching domains. The nice thing about having the ability to see changes to matches is that you can see any NEW matches or any matches that DROPPED (meaning that there was a matching domain but that the matching domain has dropped - this could because the domain is actually dropping or because the name server was removed from the domain temporarily). The current columns in the matching domains datagrid in the portfolio view are: The 3 image links for website, archive.org and snapnames backorder, your domain name, the matching domain name, the type of match (begin, end or exact), the change type (New Match, or Match Dropped) and the last column is whether or not the domain is actually dropping (yes/no).

The matches can be exported to Excel and I've got a macro that I can send users that you just click ctrl-x once the spreadsheet opens and it cleans up and formats the imported data with html links and subtotals, etc.

Here's a screenshot of the portfolio matches form (this example just shows changes over the last 24 hours):


Issues I've found in testing so far:

1. The deeplinking isn't working for some reason in Firefox and IE. It's working in Safari and Chrome.
2. I've noticed a weird issue with the search box in Chrome (and it may be related to the deep linking) where you end up not being able to remove or add characters sporatically...
3. If any tables are locked due to updates running, some of the browsers return an error pop up... not a big deal ...

If anyone finds any other issues, please let me know about them...

Tuned up the queries even more... quicker and quicker now. And I added logic within the query and app that fires off an error when the query results exceed 150,000 records. You'll see a message that says "Over 150,000 results match this query... " etc...
__________________
KJG
OneWorldMedia, ZFBot

Last edited by kengreenwood; 06-26-2009 at 11:26 PM.. Reason: Automerged Doublepost
kengreenwood is offline   Reply With Quote
Reply


Currently Active Users Viewing This Thread: 1 (0 members and 1 guests)
 
Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On



All times are GMT -5. The time now is 06:09 PM.
Copyright @2001-2009 DNForum.com