Welcome Guest Search | Active Topics | Members | Log In | Register

Spidering urls that don't exist Options
team
Posted: Tuesday, July 10, 2007 9:31:21 AM
Rank: Newbie
Groups: Member

Joined: 6/24/2007
Posts: 9
Points: 27
In the last couple of days the spider is running and running thousands of urls that don't exist! My site is broken down into baseball, college, football, etc. and it is doing things like college/baseball. How is it coming up with this? I have to stop it because it just runs and runs. I have less than 1000 pages, but this was in the 10,000 when I stopped it!
Thanks
Sponsor
Posted: Tuesday, July 10, 2007 9:31:21 AM
Get your Sitemap Generator license today! http://www.keylimetie.com/Checkout/Quick-PayPal/
KeyLimeTie
Posted: Tuesday, July 10, 2007 3:00:28 PM
Rank: Administration
Groups: Administration

Joined: 1/31/2007
Posts: 440
Points: 646
Location: Chicago, IL
Does your website pass "sessionid" or some other unique session identifier in the URL?
If so, your website is probably issuing a new session for each page the spider finds and it'll never end.
You need to put that field in the "Ignore Querystring" preference.

If you're not sure, either post your website URL here or email it to me and I'll check it out.

Thanks,
Brian
iArchitect
team
Posted: Tuesday, July 10, 2007 3:15:46 PM
Rank: Newbie
Groups: Member

Joined: 6/24/2007
Posts: 9
Points: 27
You can click on the www on my post and it takes you to our website. Myteamprints.com
I would appreciate you checking this out if you can because I don't know what you are talking about! :)
I will say that it was working just fine for the first couple of weeks we have been running it. Maybe we changed something?
KeyLimeTie
Posted: Tuesday, July 10, 2007 8:10:58 PM
Rank: Administration
Groups: Administration

Joined: 1/31/2007
Posts: 440
Points: 646
Location: Chicago, IL
Take a look at:
http://www.myteamprints.com/college/main.htm

You have a link titled "Enter to win a FREE PRINT!" that links to:
http://www.myteamprints.com/college/contest.htm

This page doesn't exist and your web server is redirecting to:
http://www.myteamprints.com/error_docs/not_found.html

This page has bad links which the application is queueing up to visit.
Take the number of links on this page times the number of pages that have links to bad pages and you'll see how it's growing so fast.
team
Posted: Wednesday, July 11, 2007 12:45:14 AM
Rank: Newbie
Groups: Member

Joined: 6/24/2007
Posts: 9
Points: 27
I fixed those bad links and re-checked the site for problem hyperlinks and find no others. It is still doing this however. My other site mypanoramicprints.com works fine. How can I find the problem. Is there something specific I should put in the query string to omit?
KeyLimeTie
Posted: Wednesday, July 18, 2007 1:16:26 AM
Rank: Administration
Groups: Administration

Joined: 1/31/2007
Posts: 440
Points: 646
Location: Chicago, IL
We will check it out ASAP. Sorry for the delay in responding.
KeyLimeTie
Posted: Tuesday, July 24, 2007 12:23:40 AM
Rank: Administration
Groups: Administration

Joined: 1/31/2007
Posts: 440
Points: 646
Location: Chicago, IL
I will has resolution for you on Tuesday, July 24th. Again, sorry for the delay.
KeyLimeTie
Posted: Wednesday, July 25, 2007 11:15:59 PM
Rank: Administration
Groups: Administration

Joined: 1/31/2007
Posts: 440
Points: 646
Location: Chicago, IL
I just spidered your entire website and it ran clean.
Please try it again...maybe clear your cache first (on Preferences tab).
Please let me know how it goes.
KeyLimeTie
Posted: Monday, August 13, 2007 12:41:12 PM
Rank: Administration
Groups: Administration

Joined: 1/31/2007
Posts: 440
Points: 646
Location: Chicago, IL
Locking topic as issue has been explained.
Users browsing this topic
Guest


Forum Jump
You cannot post new topics in this forum.
You cannot reply to topics in this forum.
You cannot delete your posts in this forum.
You cannot edit your posts in this forum.
You cannot create polls in this forum.
You cannot vote in polls in this forum.

Main Forum RSS : RSS

None
Powered by Yet Another Forum.net version 1.9.1.2 (NET v2.0) - 9/27/2007
Copyright © 2003-2006 Yet Another Forum.net. All rights reserved.
This page was generated in 0.938 seconds.