[thelist] Google release code analysis of a lot of web sites
Edwin Martin
edwin at bitstorm.org
Thu Jan 26 07:20:10 CST 2006
Peter-Paul Koch wrote:
>> Interesting bit, that:
>>
>> "In December 2005 we did an analysis of a sample of slightly over a
>> billion documents, extracting information about popular class names,
>> elements, attributes, and related metadata. The results we found are
>> available below. We hope this is of use!"
>>
>> http://code.google.com/webstats/2005-12/elements.html
>>
>
> Unfortunately the actual data is totally inaccessible since it's
> hidden in SVG graphics, one of the least supported Internet formats.
>
> An excellent example of how not to publish your data.
>
With this mindset the World Wide Web would have never taken off.
"http? html? Why can't they just put a Word or PDF-document on an
ftp-site?".
To give a new format a chance, somebody has to start using it.
Edwin Martin
--
http://www.bitstorm.org/
More information about the thelist
mailing list