Kryptronic Software Support Forum

You are viewing this forum as a guest. Login to an existing account, or create a new account, to reply to topics and to create new topics.

#1 07-09-2008 19:15:22

wyattea
Member
Registered: 01-07-2006
Posts: 1650

seo and non-seo urls - duplicate content issue

I'm logged into google's webmaster tools and google is INDEXING non-seo urls.  It was brilliant thinking that the cart displays two different kinds of URLS instead of only showing SEO urls like the ats mod and every other seo-enabled cart in the world.

***this shows the same page indexed by google - SEO URL and NON-SEO URL***
Little Boy Blue - Baby Gift Delivery - GrenvilleStation.com
‎/gift/baby-gift-basket-little-boy-blue-0011500.php‎‎
/index.php?app=ccp0&ns=prodshow&ref=baby-gift-basket-little-boy-blue-0011500&sid=53yk51rso495095447jg1qi8nte22vo6‎ 

***SAME THING***
Budding Leaf - New Baby Baskets in Toronto - GrenvilleStation.com
‎/gift/baby-baskets-toronto-budding-leaf-007900.php‎
‎/index.php?app=ccp0&ns=prodshow&ref=baby-baskets-toronto-budding-leaf-007900&sid=53yk51rso495095447jg1qi8nte22vo6‎

Baby Cakes - Diaper Cakes in Canada - GrenvilleStation.com
‎/gift/diaper-cake-baby-cakes-0011000.php
‎‎/index.php?app=ccp0&ns=prodshow&ref=diaper-cake-baby-cakes-0011000&sid=53yk51rso495095447jg1qi8nte22vo6‎

Whoever is thinking of buying ccp6, please realize you either keep seo urls disabled or PLAN to have possible google ranking issues because of duplicate content... unless there is a robots.txt 'workaround' that won't affect spidering.

James...

Last edited by wyattea (07-09-2008 19:15:35)

Offline

 

#2 07-10-2008 00:21:25

dh783
Member
From: Avondale, Arizona
Registered: 04-06-2005
Posts: 6233
Website

Re: seo and non-seo urls - duplicate content issue

James,

I think you have another problem as the second listing of these products have an sid in the url which should not be there. I have not seen any of the items from my site that google has indexed with sid's in them.

John

Offline

 

#3 07-10-2008 15:57:16

theblade24
Member
From: Tampa, Florida
Registered: 11-19-2003
Posts: 384
Website

Re: seo and non-seo urls - duplicate content issue

I have had this robots.txt file and had no issues like you describe

User-agent: Googlebot
Disallow: /index.php?app=

User-agent: Slurp
Disallow: /index.php?app=


User-agent: Msnbot
Disallow: /index.php?app=


User-agent: Teoma
Disallow: /index.php?app=

User-agent: *
Disallow: /index.php?app=


CCP 5.1
CCP 5.1
CCP 5.1

Offline

 

#4 07-14-2008 05:43:33

wyattea
Member
Registered: 01-07-2006
Posts: 1650

Re: seo and non-seo urls - duplicate content issue

so this should be fine and not block robots from visiting my pages:

# Exclude Files From All Robots:

User-agent: *
Disallow: /ccp0-emailfriend/

User-agent: Googlebot
Disallow: /index.php?app=

User-agent: Slurp
Disallow: /index.php?app=


User-agent: Msnbot
Disallow: /index.php?app=


User-agent: Teoma
Disallow: /index.php?app=

User-agent: *
Disallow: /index.php?app=


# End robots.txt file

my PARANOIA is that I add these and (because I've never used a robots.txt file), I screw it up somehow and disallow them from visiting my REGULAR pages and i don't discover until months later that i'm no longer in the SE's.  Does it look ok?

Regards,

James...

Last edited by wyattea (07-14-2008 05:43:51)

Offline

 

#5 07-14-2008 20:37:51

theblade24
Member
From: Tampa, Florida
Registered: 11-19-2003
Posts: 384
Website

Re: seo and non-seo urls - duplicate content issue

You're good with that. I actually had the email a friend line as well... but didn't post it because I didn't think thats what you were looking for.


CCP 5.1
CCP 5.1
CCP 5.1

Offline

 

#6 07-14-2008 22:35:42

wyattea
Member
Registered: 01-07-2006
Posts: 1650

Re: seo and non-seo urls - duplicate content issue

This should be the DEFAULT robots.txt when installing CCP6. People shouldn't have to discover themselves the emailafriend page creates duplicate content.

Thanks man!

James...

Offline

 

#7 07-27-2008 09:03:53

wyattea
Member
Registered: 01-07-2006
Posts: 1650

Re: seo and non-seo urls - duplicate content issue

I just realized - when a category has more products than can display on one page, it goes to a that count=20&offset=20 url reference...which begins with index.php?app=....so basically those pages won't get indexed with the disallow above.

Any ideas how to get around this? I don't want to change my # of products per page because it'll load slower.  Here's how the URL is written:

"/index.php?app=ccp0&ns=catshow&ref=christmas-gift-baskets&count=20&offset=20"

The category pages look like "/category/christmas-gift-baskets.php".

Is there a way to use mod-rewrite to make pages like above look like "/category/christmas-gift-baskets-page-2.php" or 3 or 4, etc.. - something along those lines? maybe even "christmas-gift-baskets.php?page=2" ?

Regards,

James...

Offline

 

#8 07-27-2008 15:33:55

theblade24
Member
From: Tampa, Florida
Registered: 11-19-2003
Posts: 384
Website

Re: seo and non-seo urls - duplicate content issue

It means they won't be indexed, but they should still be followed... which will then put all the products on the second page avail to be indexed.


CCP 5.1
CCP 5.1
CCP 5.1

Offline

 

#9 11-11-2008 18:05:22

Blitzen
Member
From: USA
Registered: 01-01-2005
Posts: 936

Re: seo and non-seo urls - duplicate content issue

1. The duplicated _page will be demoted, _not the entire site.
The original/first page will be just fine.

2. You can avoid duplicate penalty by restricting access to your cgi-bin or wherever your "private" scripts reside. This is similar to theblade24's suggestion.
Restricting access to cgi-bin will also restrict access to carts and the shopper's cart won't be indexed.

The only thing I'd be concerned with is if the SE gets to a page with the long URLs, it'll index that first and consider your nice URL pages as the duplicate.

Last edited by Blitzen (11-11-2008 20:31:10)

Offline

 

#10 09-25-2010 08:32:51

dukes
Member
From: London
Registered: 09-18-2003
Posts: 97
Website

Re: seo and non-seo urls - duplicate content issue

Hi All
Can people here confirm the above posting regarding adding the code below to the robots file will do as suggested and not end up with many urls not being indexed when they should be ??. Can anyone who is using this in their robots file get back to me. Any other advice on duplicate content issues would be most welcome too.  I am using the UK version of CCP6.0 with marketing module and SEO switched on. I know this is an old thread but if wyattea could respond on the above question would be great.

Thanks for your help in advance.

regards
Dukes
wwwconcept-zen.com



User-agent: *
Disallow: /ccp0-emailfriend/

User-agent: Googlebot
Disallow: /index.php?app=

User-agent: Slurp
Disallow: /index.php?app=


User-agent: Msnbot
Disallow: /index.php?app=


User-agent: Teoma
Disallow: /index.php?app=

User-agent: *
Disallow: /index.php?app=


regards
dukes

Offline

 

#11 09-25-2010 21:24:08

KryptoJim
Member
Registered: 07-08-2010
Posts: 455

Re: seo and non-seo urls - duplicate content issue

If you look at the logic in CORE_Display.php you'll see how CCP handles this.

Offline

 

#12 09-27-2010 05:10:50

dukes
Member
From: London
Registered: 09-18-2003
Posts: 97
Website

Re: seo and non-seo urls - duplicate content issue

Hi Kryptsupport
I'm not an expert in PHP so i would need some assistance in decoding that code. I can see where it does some url building in a section of the file.
I don't really have the time to learn PHP  right now although i intend to soon however if you could give me a yes no answer to the previously posted question. Can i use that section of robots file to block the urls which would cause duplicate content issues with Googles indexing.

Thanks for your help.

regards
Dukes


regards
dukes

Offline

 

#13 09-27-2010 05:50:51

Dave
Member
Registered: 07-05-2003
Posts: 11233

Re: seo and non-seo urls - duplicate content issue

wyattea is long gone but what he posted should work though I'll point out that it should not matter in the end. With the User-agent: * line I don't think you need the individual agent lines such as User-agent: Teoma also. Not all bots respect a robots file anyway.

Offline

 

#14 09-27-2010 06:42:46

dukes
Member
From: London
Registered: 09-18-2003
Posts: 97
Website

Re: seo and non-seo urls - duplicate content issue

Hi Dave
Thanks for reply. So to finally clarify if i add the following to robots file that should be enough to solve my duplicate content issues with the NON-SEO and SEO urls?


User-agent: *
Disallow: /index.php?app=

User-agent: *
Disallow: /ccp0-emailfriend/



regards
dukes


regards
dukes

Offline

 

#15 09-27-2010 06:48:54

Dave
Member
Registered: 07-05-2003
Posts: 11233

Re: seo and non-seo urls - duplicate content issue

Should be. If you have SEO enabled you shouldn't be having any duplicate content issues. If your site got indexed before you enabled SEO simply tell Google to remove the non-SEO format links from their index if you really believe it makes a difference.

I have seen/heard a lot of talk about "duplicate content issues" but have never seen anything definitive that says it is really a problem or issue. I do know it is one of the "sellilng points" SEO companies tout.

Offline

 

#16 09-27-2010 08:28:51

dukes
Member
From: London
Registered: 09-18-2003
Posts: 97
Website

Re: seo and non-seo urls - duplicate content issue

Hi
Ok thanks Dave i've implemented that change.  The murky world of SEO.
regards
dukes


regards
dukes

Offline

 

#17 09-27-2010 10:30:33

Design_Wholesale
Member
From: England!
Registered: 11-21-2008
Posts: 1104
Website

Re: seo and non-seo urls - duplicate content issue

I would treat Google reports with caution, as I get many warnings relating to things that are not really a problem at all. - Image size being a common one, even with correct implementation of Apache mod_deflate.

The other big issue with Google at the moment is site speed, yet their own tool for analyzing the performance of your website comes with a warning to the effect that the speed checks are all based on estimates ...even though Google is now penalizing slow sites in terms of ranking and visibility.

Finally, if you have a sitemap, this is something else that may help if you don't want to waste a load of time trying to hunt down each search engine's site submission process:

Code:

SITEMAP: http://litez.co.uk/sitemap.xml

- just add it to the  file (modified according to your own site address, of course).

Last edited by Design_Wholesale (09-27-2010 10:31:44)

Offline

 

Board footer