Rank: Newbie Groups: Member
Joined: 10/7/2007 Posts: 2 Points: 6 Location: UK
|
|
|
|
|
Rank: Administration Groups: Administration
Joined: 1/31/2007 Posts: 380 Points: 539 Location: Chicago, IL
|
I scanned your home page and your URLs have semicolons separating the page/folder from the querystring values. I have never seen this before and it's causing bad URLs to be stored and spider by the application. For example: <a href="/SHOPPING-CATEGORIES;jsessionid=ac112b2a1f435623313f67734f95b4c1df1e8bfdee96.e3eSbNyQc3mLe34Pa38Ta38Pahb0" class="topnavlink">Shop for Plants</a>
Are you putting "jsessionid" in the ignore querystring parameters field? Try changing it to ";jsessionid"?
If this is a standard method of appending querystring parameters, we might need to update the software to account for it. Please let us know how it goes and we'll do some of our own testing.
Thanks, Brian iArchitect
|
Rank: Newbie Groups: Member
Joined: 10/7/2007 Posts: 2 Points: 6 Location: UK
|
Thanks Brian
I was kind of afraid of that. We use an ERP app called Netsuite (www.netsuite.com). This is a "Netsuite" problem. If your crawler emulates Googlebot, then the jsessionids are 301 redirected to user friendly urls. Otherwise the sitemap hets stuffed with jsessionids. But I don't see why that should return 404's?
Best Julian
|
Rank: Administration Groups: Administration
Joined: 1/31/2007 Posts: 380 Points: 539 Location: Chicago, IL
|
Have you changed your website since the last message? When I try to spider it now, it only finds the home page...no internal links.
|
Rank: Administration Groups: Administration
Joined: 1/31/2007 Posts: 380 Points: 539 Location: Chicago, IL
|
Locking topic as no response in some time.
|