The Wayback Machine - http://web.archive.org/web/20041205212205/http://developers.technorati.com:80/wiki/BugReports

> BugReports

duplicate internal links

Technorati not only includes internal links, but has duplicates. Example:

<?xml version="1.0" encoding="iso-8859-1"?>
<!-- generator="Technorati API version 0.9 /cosmos" -->
<!DOCTYPE tapi PUBLIC "-//Sifry Consulting//DTD TAPI 0.01//EN" "http://api.technorati.com/dtd/tapi-001.xml">
<tapi version="0.9">
<document>
<result>
  <url>http://www.kung-foo.tv/blog/archives/000854.php</url>
  <inboundblogs>1</inboundblogs>
  <inboundlinks>3</inboundlinks>
  <rankingstart>1</rankingstart>
</result>
<item>
  <weblog>
    <name>chaotic intransient prose bursts</name>
    <url>http://www.kung-foo.tv/blog</url>
    <rssurl>http://www.kung-foo.tv/blog/index-plus.xml</rssurl>
    <inboundblogs>127</inboundblogs>
    <inboundlinks>159</inboundlinks>
    <lastupdate>2004-04-13 08:51:59 GMT</lastupdate>
  </weblog>
  <nearestpermalink></nearestpermalink>
  <excerpt>  Fri, 04-09-2004 static , comments (1)  , links (0) , technorati (0)   the knot that is the web</excerpt>
  <linkcreated>2004-04-13 08:57:20 GMT</linkcreated>
</item>
<item>
  <weblog>
    <name>chaotic intransient prose bursts</name>
    <url>http://www.kung-foo.tv/blog</url>
    <rssurl>http://www.kung-foo.tv/blog/index-plus.xml</rssurl>
    <inboundblogs>127</inboundblogs>
    <inboundlinks>159</inboundlinks>
    <lastupdate>2004-04-13 08:51:59 GMT</lastupdate>
  </weblog>
  <nearestpermalink></nearestpermalink>
  <excerpt>  Fri, 04-09-2004 static , comments (1) , links (0)  , technorati (0)   the knot that is the web</excerpt>
  <linkcreated>2004-04-13 08:57:20 GMT</linkcreated>
</item>
<item>
  <weblog>
    <name>chaotic intransient prose bursts</name>
    <url>http://www.kung-foo.tv/blog</url>
    <rssurl>http://www.kung-foo.tv/blog/index-plus.xml</rssurl>
    <inboundblogs>127</inboundblogs>
    <inboundlinks>159</inboundlinks>
    <lastupdate>2004-04-13 08:51:59 GMT</lastupdate>
  </weblog>
  <nearestpermalink></nearestpermalink>
  <excerpt>  Fri, 04-09-2004 static , comments (1) , links (0) , technorati (0)   the knot that is the web</excerpt>
  <linkcreated>2004-04-13 08:57:20 GMT</linkcreated>
</item>
</document>
</tapi>
Getting Odd Results from the Beta API

Was playing with JiBot, on a request to http://apibeta.technorati.com/search?query=http%3A//anarkystic.com&start=0&format=xml&key=<key> It seems to just be returning the most recent posts anywhere. I realize this ought to be a cosmos call, but this behavior still seems incorrect.

<?xml version="1.0" encoding="utf-8"?> 
<!-- generator="Technorati API version 1.0 /search" -->
<!DOCTYPE tapi PUBLIC "-//Technorati, Inc.//DTD TAPI 0.01//EN" "http://api.technorati.com/dtd/tapi-001.xml">
<tapi version="1.0">
<document>
<result>
  <query>http://anarkystic.com</query>
  <querycount>19603</querycount>
  <inboundblogs>9592</inboundblogs>
  <querytime>23.671</querytime>
  <rankingstart>1</rankingstart>
</result>
<item>
  <weblog>
    <name>Come on in, the water's warm!</name>
    <url>http://www.livejournal.com/users/allhatnocattle</url>
    <rssurl>http://www.livejournal.com/users/allhatnocattle/data/rss</rssurl>
    <inboundblogs>0</inboundblogs>
    <inboundlinks>0</inboundlinks>
    <lastupdate>2004-05-20 07:32:52 GMT</lastupdate>
  </weblog>
  <title>The Jesus Landing Pad</title>
  <excerpt></excerpt>
  <created>2004-05-19 22:47:25 GMT</created>
</item>
<item>
  <weblog>
    <name>Come on in, the water's warm!</name>
    <url>http://www.livejournal.com/users/allhatnocattle</url>
    <rssurl>http://www.livejournal.com/users/allhatnocattle/data/rss</rssurl>
    <inboundblogs>0</inboundblogs>
    <inboundlinks>0</inboundlinks>
    <lastupdate>2004-05-20 07:32:52 GMT</lastupdate>
  </weblog>
  <title>Andy Kaufman does an Elvis</title>
  <excerpt></excerpt>
  <created>2004-05-19 22:47:25 GMT</created>
</item>
<item>
  <weblog>
    <name>drigo inutilidades</name>
    <url>http://drigoinutilidades.blogspot.com</url>
    <rssurl></rssurl>
    <inboundblogs>0</inboundblogs>
    <inboundlinks>0</inboundlinks>
    <lastupdate>2004-05-20 17:32:44 GMT</lastupdate>
  </weblog>
  <title></title>
  <excerpt></excerpt>
  <created>2004-05-19 22:46:13 GMT</created>
</item>
....
</document>
</tapi> 

[AdriaanTijsseling] You cannot search for urls with the search query. If you want to find who links to an url, use the cosmos query. The search query is for words and phrases.

Invalid Token causing a "not well-formed" Expat error in python

In a cosmos query for "snowchyld.org" I get:

<item>
  <weblog>
    <name>Diary of a Madman</name>
    <url>http://sites.bytemagick.net/ramblings</url>
    <rssurl>http://sites.bytemagick.net/ramblings/index.rdf</rssurl>
    <inboundblogs>16</inboundblogs>
    <inboundlinks>17</inboundlinks>
    <lastupdate>2003-09-30 23:04:51 GMT</lastupdate>
  </weblog>
  <nearestpermalink></nearestpermalink>
  <excerpt>dony's blog @ PersianBlog Stör-Signale Hit &amp; Run hebig.com randomWalks Mahaldin's blog @ PersianBlog umigame's shopping Jon's Blog Unbound Spiral DElyMyth - Libri Nuovi, Compra Ahah's blog @ PersianBlog Rahul Sinha schreibakte Mi Opini�n connected selves snowchyld </excerpt>
  <linkcreated>2003-09-30 18:18:48 GMT</linkcreated>
  <linkurl>http://snowchyld.org/</linkurl>
</item>
That fancy character in the excerpt seems to be the cause of the problem, and it may simply be an expat related error, but AFAIK expat handles UTF-8 without problems.

[AdriaanTijsseling] I ran a cosmos on your site and I got it returned with correct encoding. I think it may be an expat error:

  <weblog>
    <name>Diary of a Madman</name>
    <url>http://sites.bytemagick.net/ramblings</url>
    <rssurl>http://sites.bytemagick.net/ramblings/index.rdf</rssurl>
    <inboundblogs>16</inboundblogs>
    <inboundlinks>17</inboundlinks>
    <lastupdate>2003-09-30 23:04:51 GMT</lastupdate>
  </weblog>
  <nearestpermalink></nearestpermalink>
  <excerpt>dony's blog @ PersianBlog St�r-Signale Hit &amp; Run hebig.com randomWalks Mahaldin's blog @ PersianBlog umigame's shopping Jon's Blog Unbound Spiral DElyMyth - Libri Nuovi, Compra Ahah's blog @ PersianBlog Rahul Sinha schreibakte Mi Opini?n connected selves snowchyld </excerpt>
  <linkcreated>2003-09-30 18:18:48 GMT</linkcreated>
  <linkurl>http://snowchyld.org/</linkurl>

Developer's mailing list archive is linked but returns a 404 Not Found.