[thelist] Google release code analysis of a lot of web sites

Edwin Martin edwin at bitstorm.org
Thu Jan 26 07:20:10 CST 2006


Peter-Paul Koch wrote:
>> Interesting bit, that:
>>
>> "In December 2005 we did an analysis of a sample of slightly over a
>> billion documents, extracting information about popular class names,
>> elements, attributes, and related metadata. The results we found are
>> available below. We hope this is of use!"
>>
>> http://code.google.com/webstats/2005-12/elements.html
>>     
>
> Unfortunately the actual data is totally inaccessible since it's
> hidden in SVG graphics, one of the least supported Internet formats.
>
> An excellent example of how not to publish your data.
>   
With this mindset the World Wide Web would have never taken off.

"http? html? Why can't they just put a Word or PDF-document on an 
ftp-site?".

To give a new format a chance, somebody has to start using it.

Edwin Martin


-- 
http://www.bitstorm.org/




More information about the thelist mailing list