|
|
#1 |
|
4x4
Join Date: Oct 2004
Posts: 1,019
|
Robots.txt Verification
I want to prevent search engines from accessing mysite.com/rss/whatever
and mysite.com/rss.php?blah=whatever I believe this si the correct stuff for robots.txt but wanted to verify with everyone here to be 100% sure. User-Agent: * Disallow: /rss/ Disallow: /rss.php |
|
|
|
|
|
#2 |
|
Site Contributor
Join Date: Feb 2006
Location: Portland Maine
Posts: 1,185
|
Looks good to me.
__________________
Ken Barbalace - EnvironmentalChemistry.com (Environmental Careers, Blog) InternetSAR.org: Volunteers Assisting Search and Rescue via the Internet My Firefox Theme Classic Compact: Based onFirefox's classic theme but uses much less window space |
|
|
|
|
|
#3 |
|
Registered
Join Date: Mar 2006
Posts: 354
|
Google's Sitemaps has a tool in the control panel that tests your robots.txt file for you.
__________________
Max |
|
|
|
|
|
#4 |
|
Senior Member
Join Date: Sep 2005
Location: Pottsville, NSW
Posts: 529
|
Do rss feeds count as duplicate content?
|
|
|
|
|
|
#5 |
|
4x4
Join Date: Oct 2004
Posts: 1,019
|
|
|
|
|
|
|
#6 |
|
Site Contributor
Join Date: Feb 2006
Location: Portland Maine
Posts: 1,185
|
Especially if others use your RSS feeds on their sites. My advice is to only provide short descriptions in RSS feeds, not entire articles.
__________________
Ken Barbalace - EnvironmentalChemistry.com (Environmental Careers, Blog) InternetSAR.org: Volunteers Assisting Search and Rescue via the Internet My Firefox Theme Classic Compact: Based onFirefox's classic theme but uses much less window space |
|
|
|
|
|
#7 |
|
Senior Member
Join Date: Sep 2005
Location: Pottsville, NSW
Posts: 529
|
Thanks - time to edit the robots.txt on a few sites
|
|
|
|
|
|
#8 |
|
Registered
Join Date: Oct 2004
Location: UK
Posts: 264
|
|
|
|
|
|
|
#9 | |
|
4x4
Join Date: Oct 2004
Posts: 1,019
|
Quote:
More information about duplicate content and supplemental results can be found on google blog. |
|
|
|
|
|
|
#10 |
|
Senior Member
Join Date: Sep 2005
Location: Pottsville, NSW
Posts: 529
|
On this subject again, do you have any concrete proof that feeds are classed as duplicate content?
The reason I ask is because I checked the robots.txt on a few authority sites (seobook.com/robots.txt) and they didn't have them blocked. And also watched this wordpress SEO video from Michael Gray and didn't catch a mention of blocking feeds Also if you blocked them - how would that work with Google Blog Search? |
|
|
|
|
|
#11 | |
|
Registered
Join Date: Oct 2004
Location: UK
Posts: 264
|
Quote:
It depends on how you set it up, if done correctly it isn't counted, you just have to look at Google's blog itself. |
|
|
|
|
|
|
#12 |
|
Trench Warfare
Join Date: May 2003
Location: Australia
Posts: 813
|
I do. Google was indexing my rss feed and my page with the same content on it was in the supplement index. Once I stopped the search engines from being able to index the feed (wordpress allows search engines to spider the feeds easily btw), the rss feed dissapeared from the search engines and my page came out of the supplement index.
__________________
|
|
|
|
|
|
#13 |
|
Senior Member
Join Date: Sep 2005
Location: Pottsville, NSW
Posts: 529
|
Thanks ozgression
|
|
|
|
|
|
#14 | |
|
4x4
Join Date: Oct 2004
Posts: 1,019
|
Quote:
|
|
|
|
|
|
|
#15 | |
|
Registered
Join Date: Oct 2004
Location: UK
Posts: 264
|
Quote:
|
|
|
|
|
![]() |
| Bookmarks |
| Thread Tools | |
| Rate This Thread | |
|
|
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| Insecurity of robots.txt ? | Nico | Website Programming & Databases | 8 | 04-24-2007 04:26 AM |
| robots.txt question | webvista | Website Programming & Databases | 2 | 06-12-2006 01:30 AM |
| Robots.txt help please | Blue Cat Buxton | Search Engine Optimization | 2 | 07-20-2005 06:57 AM |
| robots.txt | delpino | Search Engine Optimization | 2 | 01-23-2004 01:13 AM |
| Is Google ignoring my robots.txt file? | flyingpylon | Search Engine Optimization | 9 | 01-13-2004 08:47 AM |