« MSN Toolbar Beta | Main | TopStyle 25% Discount Ends Soon »

Wednesday, January 28, 2004


Feed You can follow this conversation by subscribing to the comment feed for this post.

Taken out from one of your links:

15: "have to use table instead of CSS positioning or else Netscape4 completely screws it up "

Netscape 4?, c'mon Nick, why are you still coding for such prehistoric browsers?¿ .... One thing is try to coding for a vaste audience (Mozilla, Opera, NS6/7, IE5/6, Konkeror, etc) and another one is to waste your time with a 5+ years browser (it was released for Windows 3.1!!! **yeeks**). Just my 2 cents.

I agree totally. Many people seem to think that developing for standards-compliance is an end in itself, and lose site of why or what they are gaining by standards-compliance.

Too bad fot Atomz. Other search engines let you configure tags marking non-indexing areas.
They could at least use something like ...

like !-- noindex -- !-- /noindex --

TypePad doesn't have its own XHTML-friendly search function? How sad.

I have to agree with you totally. It just does not make sense spending a disproportionate amount of time and energy struggling to get the last tenth of a percent to reach total conformity. The road to perfection is a long and winding one and in order to make it work out one needs to exercise some restraint and an open mind.

And I am wondering why you are using Atomz instead of Google?


I think this is definately an Atomz issue. Changing noindex/noindex to &ltdiv class="noindex"/div should not be much of a problem for their engine. I would ask them to do it, because the reasons for this are obvious.

PS. Removing all tags from the posts completely seems to be and odd design decision from Six Apart. I wonder why they are not just encoding all and to lt; and gt;? This would make it much easier to post html code in the comments.

Surely the solution is to provide a namespace declaration for the noindex element and then your document becomes well formed XML which should validate. You've done it for the RDF elements.

We could use a validation service that will ignore (or merely warn about) the well-formed use of elements and attributes from other namespaces than html:http://www.w3.org/1999/xhtml.

After all - standard behaviour for HTML browsers is to ignore unknown tags and attributes. (But I think we should insist on well-formedness)

I agree with SC. This is the same issue as valid XHTML Strict + target attribute issues.

No one said that XHTML CAN'T include other things, just that those things need to either be in their own proper namespace or in a custom DTD. Broken is broken in this case when there are two other options available; assuming that Atomz would grok the namespace or DTD.

!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN"
"http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd" [

I'm no DTD guru, so apply that concept to adding noindex instead.

Clearly I'm biased here but why not just use a Feedster Search for your blog? I mean we index your XML feed already. And that has no navigational links so there aren't any worries about NOINDEX tags.

http://feedster.com/blog/ to build your search.

ScottJ: I use Atomz for my entire site, not just the blog. Feedster could handle my blog, but not the rest of my site (or am I mistaken about that?).

I use Atomz Search on several of my sites. To avoid validation errors on noindex/noindex, my PHP scripts emit those tags only when the user agent is "Atomz..."

But that doesn't solve the problem completely: When Atomz returns accented characters in the search results, it turns my carefully-crafted character entities into single byte characters that are not valid UTF-8. So the search results page itself won't validate, even with noindex/noindex suppressed.

I can understand that Atomz needs to do interpret my character entities, so it can find when someone searches for "N," for example, but the search results should include the unmolested entities.

The comments to this entry are closed.