Welcome Guest Search | Active Topics | Members | Log In | Register

Sitemap Generator 4.0 adds non-existing URL's Options
MacGuiver
Posted: Friday, February 23, 2007 5:14:04 PM

Rank: Member
Groups: Member

Joined: 2/23/2007
Posts: 21
Points: 63
Location: Ede, The Netherlands
Hi Brian,

Congratulations on finishing Sitemap Generator 4.0, it looks/functions a lot better than v3.0

After using Sitemap Generator 4.0 (updated download) for 2 days I have found a couple of interesting errors, here's the first...

An example: My website contains the following files:

root/about_us.htm
root/print/about_us_print.htm

After spidering my website, www.guiver-freeman.com, Sitemap Generator 4.0 adds the non-existing URL

root/print/about_us.htm

There is no such file as "about_us.htm" in the print folder!
NOTE: This only occurs in the Google sitemap; the Yahoo! sitemap file "urllist.txt" does not contain this error.

This error adds about 30 non-existent URL's to my Google sitemap.
Any ideas?

regards,
MacGuiver




regards,
MacGuiver
Sponsor
Posted: Friday, February 23, 2007 5:14:04 PM
Get your Sitemap Generator license today! http://www.keylimetie.com/Checkout/Quick-PayPal/
KeyLimeTie
Posted: Saturday, February 24, 2007 1:02:51 PM
Rank: Administration
Groups: Administration

Joined: 1/31/2007
Posts: 440
Points: 646
Location: Chicago, IL
Sounds like a bug. I will investigate immediately and get back to you within a few days. Thanks.
KeyLimeTie
Posted: Sunday, March 04, 2007 11:27:47 AM
Rank: Administration
Groups: Administration

Joined: 1/31/2007
Posts: 440
Points: 646
Location: Chicago, IL
I will have answer for you on this today.
KeyLimeTie
Posted: Sunday, March 04, 2007 5:43:25 PM
Rank: Administration
Groups: Administration

Joined: 1/31/2007
Posts: 440
Points: 646
Location: Chicago, IL
I wish I had investigated this for you sooner...it only took about 2 minutes to investigate.

Go to page: http://www.guiver-freeman.com/print/sitemap_UK_print.htm
Check out the links for "About us", "Margriet Guiver-Freeman", "Professional Colleagues" and "Freelancers".
They're all invalid. I'm guessing all of the links on this page need a "/" at the beginning.

How did I find this:
1. Run the Spider with "High" messaging on.
2. When it's done, click the "Export messages" button.
3. Open the "export.txt" file located at:
C:\Program Files\iArchitect\iArchitect Sitemap Generator v4.0\sitemaps\<your website>\export.txt
4. Search for the page the spider found that doesn't exist. Once you find it, it will be buried in a bunch of messages. Scroll up in the messages a little until you see something like:

Pulling from queue: http://www.yourwebsite.com/somepath/somewebpage.htm


This line tell you that this page is now being spidered for links. The messages tell you EVERYTHING it is doing.

Sometimes, the page the original bad link was found is also not valid. Just do the same thing for that page (start searching from the top of the messages for that page and see where it was found). Sometimes, you might have to go back 3 or more pages depending on the complexity of your site. Also, the first time you find the page while you're searching might not be the bad link. You need to look at the messages and interpret them a little. It's not hard, just takes a little thought.

If anyone else has this problem, feel free to post the website URL and the bad URL and I'll be happy to investigate it for you and tell you how I figured it out.

Thanks,
Brian
MacGuiver
Posted: Thursday, March 08, 2007 10:28:50 AM

Rank: Member
Groups: Member

Joined: 2/23/2007
Posts: 21
Points: 63
Location: Ede, The Netherlands
Ok, Ok, you're right I just got caught out on another cut&paste screwup - forgot to reset the links to doc relative.
Thanks for the help!


regards,
MacGuiver
KeyLimeTie
Posted: Thursday, March 08, 2007 4:52:55 PM
Rank: Administration
Groups: Administration

Joined: 1/31/2007
Posts: 440
Points: 646
Location: Chicago, IL
No problem...it's good news for me ;)
KeyLimeTie
Posted: Monday, March 12, 2007 11:51:26 AM
Rank: Administration
Groups: Administration

Joined: 1/31/2007
Posts: 440
Points: 646
Location: Chicago, IL
Locking topic as the issue has been resolved.
Users browsing this topic
Guest


Forum Jump
You cannot post new topics in this forum.
You cannot reply to topics in this forum.
You cannot delete your posts in this forum.
You cannot edit your posts in this forum.
You cannot create polls in this forum.
You cannot vote in polls in this forum.

Main Forum RSS : RSS

None
Powered by Yet Another Forum.net version 1.9.1.2 (NET v2.0) - 9/27/2007
Copyright © 2003-2006 Yet Another Forum.net. All rights reserved.
This page was generated in 0.242 seconds.