advertisement

February 2004 Archives

O´Reilly´s Digital Media Blogs have been expanded and are now located at a new home. To find our new blogs, please visit:
Robert Kaye

AddThis Social Bookmark Button

Related link: http://www.id3.org/id3v2.3.0.txt

I’m certainly not known for my love of the ID3v2 specification — it is a terrible specification that I’ve had the displeasure of implementing.

However, I have finally found one redeeming quality of this specification. In section 4.15, while talking about the APIC tag, it lists the possible types of artwork that can be enumerated as part of the APIC tag:

$00 Other
$01 32×32 pixels ‘file icon’ (PNG only)
$02 Other file icon
$03 Cover (front)
$04 Cover (back)
$05 Leaflet page
$06 Media (e.g. lable side of CD)
$07 Lead artist/lead performer/soloist
       . . .
$11 A bright coloured fish
       . . .
$14 Publisher/Studio logotype

A bright coloured fish, eh?

Got any other cute specification bloopers/easter eggs?

Robert Kaye

AddThis Social Bookmark Button

ETech is now behind us — it’s like Xmas in so many ways. You spend much time looking forward to it and then its over in a flash. Alas.

ETech was a blast and extremely valuable — no question about it. There are so many facets of the conference that cannot be captured in a weblog entry; you have to be there to experience it. Tim Appnel already touched on this a little in his blog entry The End of Near. There are so many cool people having cool coversations that if you tried to capture some of these conversations in their entirety we’d be accused of name dropping and bragging.

I’ll take that risk and try it anyway — I’ll just slightly obfuscate the personas involved:

Coming back from lunch on thursday I started chatting about the whole concept of the gift economy with a member of a vital international standards organization and the CTO of a prominent non-profit, when the inventor of a ground breaking business application butted in and proceeded to lecture us on the semantics of the term gift economy. He stongly advocates that people use a more suited term — he took objection to term gift.

Where else but at ETech can you be rubbing elbows with the people who make high tech happen, only to be schooled by one of the legendary names in computer science??

This year’s ETech presented the usual collection of stimulating keynote speeches, presentations, tutorials and panels. But one subtle thing about the conference was different — the number of female attendees this time was greater than in previous years. The computer science field is chronically unbalanced when it comes to the sexes and the first few ETech conferences were no different. I’m not sure what caused the change, but I was delighted to see a healthier balance of the sexes. Unfortunately, it was still far from balanced, but the overall trend seems to be improving — at least at ETech.

I had a blast in San Diego — the hotel was a good venue for the conference, even if the hotel staff seemed befuddled at the attendees at times. (I observed a hotel staffer looking dumbstruck at one attendeee who was holding two open laptops in his hands and still wildly gesticulating as he spoke).

Thanks to Rael, Gina and the whole O’Reilly Conferences staff — you all rock!

If you made it to ETech this year, what were your thoughts??

Jon Lebkowsky

AddThis Social Bookmark Button

Related link: http://www.iraqwarreader.com/archives/000133.html

I added a comment to Micah Sifry’s blog item about O’Reilly’s Digital Democracy Teach-In, showing where I think he presumed too much.

Here’s an extract:

The reason some of us focus on technology is NOT that we somehow believe technology is the great panacea and healer of all political wounds. The reason is that we’re technologists, and this is what we do. You were *at a technology conference*, so you could naturally expect that focus, no? To imply that we are somehow dewy-eyed and utopian because we focus on what we do is rather insulting, but worse, it misses the point.

Will we effect change overnight? Certainly not. Many of us compare what we’re doing to the work the Republican right did over the last couple of decades. They built think tanks, cultivated leaders, and organized through churches. Essentially they were building through their social networks, which is what we also hope to do, using social technology.

Follow the link to read Micah’s thoughts and my complete comment.

Technology is clearly transforming some aspects of politics, but is its impact superficial? Or can we expect significant social changes driven by our uses of relevant technologies?

Jon Lebkowsky

AddThis Social Bookmark Button

Related link: http://www.itconversations.com/detail.php?id=77

At O’Reilly’s Digital Democracy Teach-In, Wes Boyd of MoveOn gave the five-year history of the organization, which formed in response to the call for Bill Clinton’s impeachment hearings and has evolved into a significant political force. Boyd said that MoveOn wants to be about democracy, but is forced to play an attack-and-defend game because that’s the kind of narrative favored by broadcast media. Attack-and-defend breeds cynicism and undermines civic engagement, but it’s a form of entertainment, and as such, it sells advertising.

The Internet is a different kind of media environment than broadcast, pluralistic and highly interactive, a potential platform for civil discussion and debate where, says Boyd, “you can win by saying things that are sensible and moderate, not crazy.” This is what MoveOn has tried to do.

Boyd says there may be an emerging public mind that is smarter than any of us as individuals, its nature not yet completely clear. “We’re all actors in making it happen,” he says. He’d like to get people who are not in agreement talking to each other. He’d like to see more empirical work to figure out what’s really going on with people.


For the moment MoveOn is working to enourage replication, partnering with other advocacy organizations.

My own thought is that an organization like MoveOn could become a democratic platform (technically, socially) for various kinds of progressive advocacy, a framework in which MoveOn, its partners and engaged citizens can work to stimulate participation in political process.

Brainstorm: what online tools are most effective in catalyzing political action? What political tools would you like to see in the future?

Rick Jelliffe

AddThis Social Bookmark Button

Related link: http://www.internetnews.com/dev-news/article.php/3312091

The technology behind
Microsoft’s patents and applications related to
"http://www.computerworld.co.nz/news.nsf/0/49FD37FEB9B07814CC256E260077E78C?OpenDocument"
>word processor
documents in a single XML file and multiple
>scripts in a single XML file is fascinating.

Their wonderful
Extensible Markup Language Patent Machine
can generate a customized patent application
for anything that can be collected into a single XML file,
and “vocabulary miners” are systematically working
through every known class of data:
spreadsheets, config files, media, graphics,
financial data, patches, virues etc., submitting the auto-generated patent applications
to the scrupulous attention of the Royal
>Tongan Patent Office.

On and on, the machine
diligently chugs through the whole dictionary.
Fortunately,
"http://www.usatoday.com/money/companies/regulation/2004-01-13-patentscover_x.htm"
>patent offices
have their own
patented "http://www.aboutrubberstamps.com/rubber-stamping-history.html"
>rubber-stamping
machines to cope.

A personal computer on every desktop and an XML
patent on every noun
” is the company’s new
slogan, according
to information recovered from a leaked company word processing file on a different topic.
“Innovative research has never been easier!”, claimed a
spokesman from the
"http://research.microsoft.com/nlp/projects/mtproj.aspx"
>Machine Translation
project.

But in a cruel
"http://www.jaffebros.com/lee/gulliver/moonlight/plate45.jpeg"
>twist
of fate, it seems that this
there is already some
>prior art which would prevent the
XML Patent Machine itself
from being patented.
And Microsoft might find a challenge from
Apple, who are assembling iPods into a
SETI-like supercomputer to generate
patents for everything using two files.
Apple spokeman, Kevin Spline spins this as
“You can make a better user interface with
two files, because humans have two eyes!
Its just basic math.”

Open Source advocates have charged that
this is just an attempt to force everyone
to adopt Microsoft’s database-based file
systems by making files unworkable.
“No, no, that is our strategy for email”
a Microsoft representative said, speaking on
terms of strict fictitiousness.

In a related development, leaders from developing
countries applauded the new patent:
a source close to one leader said
“This will stifle the development of
a vibrant indigenous software sector,
depress an imaginative middle class and
promote brain-drain. We don’t need those
kind of trouble-makers.”
An interesting analysis from
"http://www.economist.com/displayStory.cfm?Story_ID=300582"
>The Economist

magazine claims that the more
enlightened leaders of newly industrialized
nations are worried that
unless the developed nations can keep up their
rate of monopolistic appropriation
aided by various Free Trade negotiations,
the West may not have any
money to buy anything
and will need pocket money.

Is “I saw it first!” really the way to organize a peaceful world?

Robert Kaye

AddThis Social Bookmark Button

Related link: http://www.foaf-project.org

As it often happens, I have a basic understanding of a technology before I show up at ETech and then I attend a talk and my whole world changes. Once you see the enthusiasm and wild gesticulations of the speaker(s) you catch their enthusiasm and the deeper meaning and context of the technology becomes clear.

Listening to Dan Brickley and Edd Dumbill’s talk about FOAF really drove home the key points and the greater context behind FOAF. FOAF stands for Friend Of A Friend and is a distributed approach to social software. FOAF uses XML/RDF to express information about a person:

I am Dan and I work at W3C and I know Libby who works at ILRT and her FOAF profile is [over here] and here are the titles and descriptions we wrote together.

FOAF is essentially a machine readable version of a person’s home page. Interpreting and understanding an HTML based homepage for a person is still too complicated a task for computers. FOAF tackles this by expressing relationships and information about yourself in XML/RDF. Given this machine readable format, FOAF lets you describe anything about yourself or anyone else.

Unlike centralized social software sites like Friendster, Tribe.net and Orkut, FAOF takes a completely decentralized approach. FOAF profiles are normal XML text files that are stored on your website — just as normal home pages are. This means that the data about yourself belongs to you and not to the social software site. Orkut’s terms of service state that the data about you is not owned by you — it is owned by Orkut (Google). This is a fundamentally bad idea — my data should be owned by no one but myself.

FOAF’s distributed nature gives you the power to put the data back in your control, and on a server/website that is controlled by you. This freedom comes at a cost that is typical when centralized services become distributed. Authentication is just one of the difficult problems to solve — how do you know that a given FOAF profile really does belong to the person who claims it? Centralized services can easily solve this problem by authenticating users and restricting the user to edit only their own data.

The top-down approach used by central server based social software systems will ultimately limit the scope of what can be done with these systems. The decentralized bottom up approach of the web that enables anyone to do anything has proven to be a great approach. If the web was controlled by any one entitiy (benevolent or evil) it would not have grown at the amazing rate that we’ve witnessed.

I think FOAF is the right approach for the future, but many difficult problems remain — how will FOAF handle lying, trust, reputation? I’m hoping that we can solve these difficult problems soon — if nothing else, FOAF will expose these problems to the greater public which will get more brains thinking about the problem. And that’s certainly a step in the right direction.

Do you have a FOAF profile? What are your experiences with it so far?

Rick Jelliffe

AddThis Social Bookmark Button

Charityware is a kind of shareware, where
the developer asks the customer to donate something
to charity. The >VIM text editor is a good example of this.

That is lovely for private developers wanting
to use their hobby for good and merit, but there is
another kind of charityware,
(anyone know the name for this?),
that individual and corporate
commercial application developers can
consider.

Why not license your application free to
charities? With electronic distribution it
is almost no expense, it increases your user base
and goodwill at the initial stages of product
release, it probably does not lose full-price customers,
it makes you and your team happy,
and, more than anything else, it is good
for the world.

My company has been doing this
for a while now, and I recommend it.
We use an adjusted version of our evaluation
license, forbidding re-distribution or
commercial use and not promising support.
We also keep track of license keys, so we
know the source of pirated versions.
People email us with details, we vet them
quickly, with a polite “sorry” if they not
really charities (not all not-for-profit
organizations are charities!).
We have been picked up by charities in five
continents.

In my experience, the key thing is making
sure the charitable licenses are almost zero
cost to administer: we take about five minutes
per license and we get a handful of requests
per month. Its not a big deal for us, but
you never know, it could be pivotal for some
charitable project.

I guess this mostly applies to shrink-wrapped,
end-user applications. I suspect a different
kind of license would be most useful for
commercial libraries: one that says that the
library can be supplied free of charge for use
in products supplied free of charge to registered
charities for their exclusive, non-commercial
use but not re-distribution.

Is there a name for this kind of license?

Robert Kaye

AddThis Social Bookmark Button

Related link: http://www.technorati.com/

ETech 2004 is in full swing and my talk is behind me — time to do some serious blogging!

My favorite session of the day covered the basics and the newest glitzy features in Technorati. David Sifry explained how Technorati aggregates trackback pings from 100,000 weblogs daily and creates a link cosmos for a given URL. The link cosmos tells webloggers how many other weblogs have linked to a particular weblog entry.

At the most basic level Technorati lets webloggers know how well connected their weblog is — from this you can infer the basic picture of how healthy and effective your weblog is. On a macroscopic level it shows the overall health of the blogosphere as well. 100,000 updated weglogs per day and a new weblog being created every 7.8 second shows us that weblogging is alive and well. With so many weblogs being created, it begs the question how many of these are abandoned — Technorati has an answer for us: 35% of all weblogs are dead, where a weblog is considered dead if it hasn’t been updated in 3 months.

So far, that’s all interesting, but pretty simple stuff. The really interesting aspects of Technorati come into play when you consider the vast amount of data that has been accumulated and what can be done by analyzing and mining this data. Sifry pondered if there was any substance to Clay’s claim about Powerlaws and weblogs and concluded: “F*ck it, I have the data!”.

Sure enough, Kevin Marks had plotted the data collected from Technorati and also came up with this cool log-log graph which proves that there is a powerlaw in weblogs. Not surprising — I bought into Clay’s argument a while ago — but it was really nice that a graph showed the reality of what is actually happening.

I had visited Technorati before, but I never really grokked the deep implications of their work. Technorati has a clear grasp of the health of the blogosphere and a vast amount of data from which to draw more conclusions. With very little hacking Sifry managed to create Top products discussed in the last 24 hours.

Technorati has all sorts of plans for adding more features and extending their REST based API — I’m curious to see what Sifry & Co will come up with next. I’m glad to have learned about this cool resource — thanks for the great talk Mr Sifry!

What do you think of Technorati?

Mark Sigal

AddThis Social Bookmark Button



I have recently started devoting disciplined energy towards
becoming financially literate.  You may wonder, "What does financial
literacy have to do with the O’Reilly audience?"  Well,
after spending some time playing around with the different financial web
sites and reading a really interesting book, I started to contemplate a new web
site (destination) and web service (APIs). The book I just read is
called, “ href="http://www.amazon.com:80/exec/obidos/ASIN/0971695830/verdada-20?creative=125581&camp=2321&link_code=as1"
target="_top">The Vital Few vs. The Trivial Many,” by George Muzea, and
what is interesting about it is that it introduces a few meaningful
concepts that I hadn’t previously contemplated. 

 

One is a market fundamental that sixty percent of all stock
price movements are related to the overall trend of the market.  This
suggests, somewhat intuitively, that your ability to make money in the stock
market is in direct proportion to your ability to invest only when the
percentages are with you.  It just so happens that there is a narrow
window of time that the market percentages are cyclically unfavorable, allowing
you to buy in at favorable prices, and a narrow window of time that the market
percentages are cyclically favorable, allowing you an ideal sell point (the
proverbial buy low, sell high).  Since search patterns are well formed for
ferreting out such opportunities, but the requisite publicly available data
sources (that must be parsed) are dispersed over the Internet, this is an ideal
application for “information management” automation.

 

In a nutshell, here’s how it works.  You buy beaten
down stocks in mid-October and sell them all in mid-February the following
year, making this a four-month strategy (I wish I was writing this column in
October, but that’s another story).  The logic is that the market is
inherently down-pressured by the institutional tax loss selling that must be
completed by October 31 as mandated by the IRS (as opposed to tax loss selling
for individuals, which can be completed until December 31), and up-pressured by
the new money that flows into IRAs and 401Ks at the beginning of a new
year. 

 

So how do you automate this?  Step one:
start with the NYSE "New Low List" (in href="http://www.barrons.com/">Barron’s weekly and href="http://online.wsj.com/public/us">Wall Street Journal daily), which
essentially is a list of companies hitting 52-week lows. Step two:
subtract liabilities and long-term debt from assets on an identified company’s
balance sheet, removing entries that are a negative number (this data can be
accessed at CBS Marketwatch).  Step
three
: from the remaining list, search the ten-year chart of each
remaining stock (also available at CBS Marketwatch).
If the current stock price of a given stock in the list is not in the lower
third of its ten-year price history, remove it from the list. Step four:
from the remaining list, filter on institutional ownership, where the
institutional ownership value equals 30% or greater (data that is available at href="http://moneycentral.msn.com/investor/invsub/ownership/ownership.asp">MSN
Money). Step five: filter out remaining entries that have not
at least shown a penny of profit in the last quarter’s earnings (also available
at CBS Marketwatch).  The
interesting thing, and part of the reason automation is compelling for this
exercise is that from a list of 100 or so "New Lows," maybe 10 will
remain after completing this process, and these stocks will have both strong
balance sheets and current earnings, be in the bottom range of their 10 year
trading history, and down pressured by the institutional tax loss selling that
must be completed by October 31st.  Once the cyclical down pressure is
replaced by the cyclical up-pressure on January 1, these stocks should rise.
 The author of the book suggests that you determine your total investment
and buy equal dollars (not shares) of each of the quality stocks and
then sell them all by mid-February. The author has used this strategy
successfully in 24 out of 25 years. 

 

Similarly, the author separates the market into what he
calls The Trivial Many, the mass investor market and so-called experts,
who you effectively want to bet against when investing, and The Vital Few, corporate
insiders, such as officers, board members and major shareholders, whose buying
and selling actions you should track like a hawk.  Generally speaking,
insiders sell into price strength and buy into price weakness (again, buy low,
sell high). You want to look for new stock purchases where insiders are buying
as the stock goes up or when insiders are buying a depressed stock, and you
want to see them increasing their ownership percentages by at least 30 percent
of their holdings in the company. Also, since it is normal for insiders to buy
as their stock goes down and sell as it goes up, you particularly want to look
for divergence from this normal behavior. For example, your
eyes should be wide when you see an insider, especially the chief financial
officer who normally sells stock only when price rises suddenly break this
pattern by selling into price weakness. It usually means that the company’s
business conditions have deteriorated and that bad news is coming. On the other
hand, you should be really impressed when you see insiders buy at higher prices
than their earlier purchases. This usually means that business conditions are
at least as strong as when they first bought, and in many cases, getting
stronger. Better than expected news will more than likely surface a few months
later. 

 

So how do you take advantage of this one technologically? 
First off, insider actions are required to be registered with the SEC within
two days of their initiation (e.g., an insider sells a block of their stock
holdings), and this data is accessible within a specific database, known as href="http://www.sec.gov/edgar/searchedgar/webusers.htm">EDGAR.  While
there are premium services that show correlations between a given insider’s
action and movement of the stock, the basic data is out there gratis.  The
specific type of filing to search for is known as Form 4, or “Statement of
Changes of Beneficial Ownership.” 

 

As there is a fairly finite amount of insider activity, and
insider buys are generally more predictive than inside sales, database size
should be manageable even over a period of years.  As to why insider
buying is generally more predictive than insider selling, consider that
insiders may sell for any number of non-business related reasons (such as to
buy that vacation home) but will generally only buy if they believe that the
market has under-valued their company’s forward looking business
momentum. 

 

One way of fine tuning such a search query string is to
track all insider buying actions, in terms of adding them to the database, but
set a special flag to alert you when there are follow-on stock purchases by the
same insider in the face of an increasing stock price (i.e., an insider buys
10,000 shares when the stock is at $10, and then purchases a similarly
meaningful amount of shares when the stock hits $13 a few months later.  The
logic here is that follow-on purchases in the face is rising share prices are
heavily predictive.

 

Again, you can tweak the model in terms of what constitutes
a meaningful amount of shares (in terms of number of shares or absolute dollars
relative to the insider’s holdings in the company).  Similarly, you can
track and set your own flags for correlations between frequency of buys,
multiple buyers within the company, timing of purchases and the stock’s price.

  

The key point in all of this is that you can build a web
site that expresses your best stab at providing online answers to different
“what-if” financial questions built around objective models tied to predictive
data that is publicly available.  Further the underlying elements of this
web site can be expressed as a web service, enabling like-minded peers to tweak
the inputs and outputs to their hearts desire.  So, for example, if a
consumer of your web service loves a columnist, like href="http://www.forbes.com/columnists/col_archive.jhtml?aname=Kenneth+L.+Fisher&author=kenneth+and+fisher">Ken
Fisher or Herb
Greenberg
, or an online pub like Motley Fool,
they can build and maintain a portfolio that tracks their favorite columnists’
recommendations and cross-validates them around the insider-buying model. 
Pretty cool, I think.

What type of analysis do you perform on your investments, and how might they complement such a web site or service?