Google Blog Search & Invalid RSS XML
Did you know that there are some Google Blog Search RSS feeds that don’t validate?

It looks like Google’s having some internationalization issues! The specific errors in question are:
Your feed appears to be encoded as “UTF-8″, but your server is reporting “ISO-8859-1″
line 4, column 315: description contains bad characters (18 occurrences)
line 6, column 75: title should not contain HTML (3 occurrences)
line 9, column 41: title contains bad characters (2 occurrences)
| This entry was posted on Saturday, September 24th, 2005 at 8:31 am and is tagged with internationalization issues, google, utf 8, occurrences, line 6, rss xml. You can follow any responses to this entry through the RSS 2.0 feed. You can leave a response, or trackback. |
3 Responses to “Google Blog Search & Invalid RSS XML”
Leave a Reply
[...] Kui vaadata, millised väljad on kummalgi uudistevool kohustuslikud, siis Atom nõuab viimati muudetud (last-updated) aega. Igaüks otsustab ise, kas see tuleb kasuks või kahjuks. RSS otseselt artiklit kokkuvõtvat teksti ei nõua, kuid Atom’il on eraldi kokkuvõte (summary) ja sisu (content), mis on väga heaks tööriistaks näiteks sel juhul, kui postituse sisuks on mõni video Youtube’st. Lisaks ei suuda RSS eristada, kas feed‘is asuva teksti näol on tegemist tavalise tekstiga, HTML’iga või XML’ga. Sellest ka mõned segased veateated vahest. [...]
Spammers suck a lot
I hate spammers! (