March 2007 Archives

M. David Peterson

AddThis Social Bookmark Button

According to my clock we’re only 52 minutes into the first of April and we already have the clear and uncontested winner…

How TiSP Works

Google TiSP (BETA) is a fully functional, end-to-end system that provides in-home wireless access by connecting your commode-based TiSP wireless router to one of thousands of TiSP Access Nodes via fiber-optic cable strung through your local municipal sewage lines.

And if that wasn’t enough, how about,

DING, DING, DING > We have a winner! ;-) Everyone else: Just give up. You’re not going to win.

Thanks for making me laugh, Google! :D

M. David Peterson

AddThis Social Bookmark Button

OLPC

Touch more Englebart and Kay (and plenty of Berners-Lee) please, hold the Gates and Jobs.

*FANTASTIC* review from Danny Ayers on the OLPC. You should take the time to read it.

So Danny,

Rick Jelliffe

AddThis Social Bookmark Button

Schematron uptake is on the increase, and the betas implementation of ISO Schematron is chugging away. The relevant working group at ISO (ISO/IEC JTC1 SC34 WG1) has asked me to look into preparing an update for the standard; most of the other ISO DSDL family of schema languages have just been through a round of corrections based on initial experience, and I want to prepare something by the end of May.

There won’t be any changes that would break existing ISO Schematron schemas. And I don’t think there would be any extra logical apparatus or changes to the class of logic required; and certainly nothing that would prevent implementation in XSLT 1 by default.

I am interested in gathering a wish list, especially things where you have extended Schematron (i.e. your requirement was strong enough that you actually wrote some code!) Please let me know.

Kurt Cagle

AddThis Social Bookmark Button

Adriaan de Jonge’s article Xforms vs. Ruby on Rails has created quite a stir in both the XML and Ruby communities, and for good reason. He asked a fairly important question - are XForms an also-ran technology that Ruby has managed to supplant?

I’ve been thinking about the article for some time myself. Certainly it’s a difficult question to answer, because to a certain extent I tend to agree with him … on some aspects. XForms had a lot of potential that it hasn’t yet lived up to, does require more than a little bit of advanced computing skills (or at least the right mindset) and suffers the fate of many of the W3C standards, which is to be smashed up against the rocks of browser vendor indifference.

And yet … I’m not quite ready to throw in the towel. Now, don’t get me wrong. I’ve worked with Ruby, and overall I think it is a delightful language for many of the same reasons that Adriaan does - it is more declarative than imperative, it places unit test planning well ahead of coding, it fits in very nicely with the web development paradigm, and it works reasonably well in mapping basic data sources into objects. It’s contributions to the development of JavaScript have been crucial to the success of AJAX, and overall I think that it would not be a bad skill for any web developer to have on his or her resume.

Hari K. Gottipati

AddThis Social Bookmark Button

While Target is facing the lawsuit by NFB on accessibility issues, Amazon realised the importance of accessibility and they are going to make Amazon.com accessible for blind people via screen readers. Lately AJAX accessibility issues caught vendors attention and lot of companies including Bindows focused on implementing the accessibility functionality for Ajax applications. Today Amazon.com and National Federation of the Blind joined the forces to develop and promote web accessibility. From the press release:

The National Federation of the Blind (NFB) and Amazon.com announced today that they have agreed to work together to promote and improve technology that enables blind people to access and use the World Wide Web. In a cooperation agreement, Amazon.com pledged its commitment to continue improving the accessibility of its Web site platform, while the NFB committed to contribute its expertise in Web accessibility technologies to help further Amazon.com’s efforts.

From the NFB and Amazon agreement:

PART 3 - ACCESSIBILITY TIME TABLE
A. Amazon commits to work to provide Full and Equal Access on Amazon.com and Syndicated Store Web Sites, to the extent such access is not already available, by no later than December 31,2007 and continuing thereafter.
B. Amazon commits to work to implement technical measures, to the extent any are necessary, no later than June 30, 2008 and continuing thereafter, so as to ensure that third parties to whom Amazon delivers e-commerce services are not prevented by Amazon-supplied technology from providing Full and Equal Access on their Merchant.com Web Sites.

This is a good sign and I am sure more applications/sites will follow this. Good move by Amazon.

M. David Peterson

AddThis Social Bookmark Button

If you were too late, and were denied access on your first attempt, word on the street is that Brad found ways to distract upper management with “Hot” Krispy Kremes and as such, there’s a chance he can sneak you in the back door during the aforementioned moment of distraction. Let him know *SOON* (bradATbungeelabs.com)

If you already have a spot, or are able to gain one via Brad’s ever so sly “Krispy Kreme method”: See ya there! :)

M. David Peterson

AddThis Social Bookmark Button

Update: As per my follow-up to orcmid,

Oh, I think the marketplace is a *GREAT* idea, and in fact is *WELL* overdue. They should have been doing this all along! I was just laughing at Oleg’s follow-up phrasing of one (of many!) ways you could utilize OSS to your financial benefit. In fact, I almost titled the post “On SourceForge and Open Source Obfuscation”, (and probably should have now that I think of it), but chose not to for some odd reason.

I was in a hurry, and didn’t extend things as I normally will, so my apologies to those of you left with the impression that I thought the SourceForge Marketplace was in any way a bad thing.

[Original Post]
Signs on the Sand: SourceForge Marketplace

Sounds interesting. Another way to get rich - create great open source product, make your code unreadable, provide no documentation and then sell support :)

Michael Day

AddThis Social Bookmark Button

[Update: see below]. A few years ago, Eric van der Vlist put together a proof of concept XML schema language called Examplotron. The clever part of Examplotron is that the schema for a given XML document is that document itself; a document is its own schema. This allows schemas to be designed by writing down example documents (examplotron, get it?) which can then be generalised automatically to produce a RELAX NG schema for those documents and other documents like them. Clever. Now, what if XPath worked like that?

Andrew Savikas

AddThis Social Bookmark Button

Dear XML Mind,

I was disheartened to learn that you’ve chosen to make the free version of your product even less useful to end users. This appears to be an acceleration of a recent (and alarming) trend in your release cycle of removing features for non-paying customrs.

While I can certainly understand the desire to convert more customers to paying ones, I suspect that what instead will happen is that people will abandon the product entirely, rather than risk running afoul of the license.

Here at O’Reilly, your XML Editor has helped to bring an XML workflow to many more mainstream users (in authoring and production of manuscripts) than any other tool available. Among the biggest selling points for authors has been their ability to dowload and use a fully functional version of the product for free (making it a viable alternative to OpenOffice, to our advantage). It’s a very low-risk way to test the waters.

As we develop and expand our XML workflow, we’ve purchased a half dozen licenses for the professional version of XML Editor, and have had several discussions about purchasing an Enterprise-level license within the next 6-18 months (partially pending the addition of a revision-tracking feature). However, this latest announcement means that we’ll need to seriously reconsider that — most of our authors are not on site, and are not employed by us, so would not be covered by an Enterprise license. And it is not economical for us to purchase individual licenses for those authors (200+ per year).

Again, I can certainly appreciate your desire to convert more free users into paid ones, and I of course don’t know the specifics of your business model or situation. But I *strongly* encourage you to reconsider your decision on the Personal Edition license. While it may well be the case that some of your users who are using the Standard Edition are getting something “for free” that they would otherwise pay for, I suspect that the vast majority will not convert to paid customers, and the overall user base for XML Editor (and hence potential market) will shrink. As with the case of music and software piracy, you may soon find that such restrictive measure have the opposite effect of what you’re intending (see Piracy is Progressive Taxation)

Regards,

Andrew Savikas
Director, Digital Content and Publishing Services
O’Reilly Media, Inc.

Rick Jelliffe

AddThis Social Bookmark Button

A few people (this means you Trung!) have emailed me recently about more information in Schematron. Good news: Eric van der Vlist has just had a little PDFbook, in O’Reilly’s Shortcuts series, on Schematron

It appears to feature Gollum on the cover, according Christophe Lauret :-)

Eric gave me the chance to comment on a draft (I like to do technical editing on one or two books a year, to keep my hand in), so I will be interested to see how it shaped up. I think Eric tackles pretty well that we are currently on a cusp, with some people using Schematron 1.5 and some moving on to ISO Schematron.

M. David Peterson

AddThis Social Bookmark Button

In regards to the title, as anyone who writes code for a living understands, things don’t always turn out the way you plan. ‘nuf said. ;-)

So this is really more of a status update in regards to the ongoing Atom Publishing Protocol theme that has pretty much dominated much of my focus, both in blogging as well as code, for the last couple of weeks months. That said, there is one open source project that has really caught my eye as of late that I want to bring to your attention, and will do just that in wrap up at the end of this post.

As alluded to above, I’ve been heads down for the last who knows how many days/weeks/now months (I don’t keep track <- obviously! ;-) with a fairly APP-focused frame of mind. Though it wouldn’t and won’t be obvious what I mean by this until it is (<- and thats pretty much all I can say on the matter (for the moment, anyway)), my professional development focus is becoming increasingly honed into finishing some fine-tuned details of several Vibe*-related projects, one of which we nearly launched recently, then decided to hold back to place attention on some detail work, much of which is directly related to the writing I’ve been doing on nuXleus:Xameleon (Amplee, AtomicXML, LiveClipboard, ModuleT, nuXleus, and so forth). I’m excited by all of this in ways that I can not describe. I hope when you see the result, you too will feel the same level of excitement. More on that when I am able.

In the mean time,

M. David Peterson

AddThis Social Bookmark Button

Update: via http://www.rpath.com/rbuilder/tryItNow?id=1,

This appliance can be run in the Amazon Elastic Compute Cloud (EC2), compliments of rPath. Click on the button below to launch the appliance. Once the boot process is complete, additional instructions will appear and you can complete the installation. Then, use the MediaWiki appliance in the cloud!

Just tried it and it seems you get about 15 minutes worth of play time via the rPath Appliance Agent interface which allows you to change the password, create an admin+password for the MediaWiki instance, add an email address (part of the MediaWiki setup process, though it seems any old email (read: fake**) will do) and then access the MediaWiki instance itself.

Nice touch, rPath!

** NOTE: Don’t use any periods in the admin name OR the email address you provide. Using m.david and m.david@fill_in_the_blank threw errors for both, which is why I decided to had little choice but to use a fake email address, as 95% of my email addresses have periods in the handle segment.)

[Original Post]
rPath - rPath Teams with Amazon Web Services

It will work like this: software developers use rBuilder to build an Amazon Machine Image (AMI) that is stored using the Amazon Simple Storage Service (Amazon S3). Then, with a single click, rBuilder and rBuilder Online users can boot their software appliances on Amazon EC2. No more waiting for downloads or fighting with complex installation procedures. Software appliances plus Amazon EC2 deliver software value without the hassles - on-demand. To learn more visit: www.rpath.com/amazon.

So firstly, this *ROCKS*!

Secondly,

Kurt Cagle

AddThis Social Bookmark Button

A couple of weeks ago, the W3C made an announcement that caught a great number of people by surprise. After nearly a decade of inactivity, the HTML working group was being restarted, in order to handle the fairly significant amount of development that has occurred on top of the HTML standard since HTML 4
.3 became the last formal HTML standard prior to the introduction of XHTML.

I have to admit to some qualms about seeing this. I’m not doubting that it isn’t needed - XHTML’s adoption has been comparatively slow because of the legacy base of HTML out there, the introduction of AJAX has shifted the balance of power to imperative scripting, and the realization is increasingly being made that the namespace issues dividing HTML and XHTML are beginning to tear the standards apart.

The question, however, comes back to the role that XML ends up playing in all of this. HTML has its own DOM, can in fact be treated as quasi-XML-like, but it most demonstrably isn’t well formed XML in most cases. For those of us who have been pushing XHTML adoption in industry, this is going to be seen as a fairly major step backwards, as it has the potential to make browser developers decide that perhaps incorporating XHTML support isn’t that big of an issue, and can be pushed off for a release or ten.

M. David Peterson

AddThis Social Bookmark Button

But much to my surprise, MSFT isn’t the one cashing in on the traffic,

CodePlex Information and Discussion

CodePlex gives project owners the choice of placing sponsored ads on their project pages. Project advertising is provided through Kanoodle BrightAds, and all the proceeds from Kanoodle go entirely to the project owner.

Many open source developers work long and hard on their project efforts and use donations or sponsorships as a way of helping to support their efforts. We wanted to give project owners the ability of having sponsored ads for their project if they choose.

Nice! So MSFT: while you seem to be in the giving mood, can you please implement support for Subversion, an option to use Trac, as well as the ability to allow direct deployment of ClickOnce and/or ClickThrough apps from a project repository? That would be just peachy!

Thanks in advance for your considerations. ;-)

Kurt Cagle

AddThis Social Bookmark Button

My father is a student of semantics, and he was the one to first get me interested in it. His own interest came when he went to university, around the time that such seminal figures as S.I. Hayakawa, Noam Chomsky and Marshal McLuhan were challenging the boundaries of what we mean by meaning. Hayakawa, linguist and protege of the great Korzybski, examined the way that language is both shaped by and shapes our thoughts, our interactions, even the underpinnings of what we call civilization. Chomsky is perhaps known today more as an impassioned speaker against the rise of corporatism, but at the time he was doing ground-breaking work exploring the origins of language and the process of learning how to speak, write and thing. McLuhan, on the other hand, challenged our assumptions about the various media of communication, and is perhaps most known for the statement, “The Medium is the message”.

Curiously, none of these three were “engineers” in the sense that we think of the term today, nor were they programmers. Their questions revolved around the issues of how humans deal with communication, with turning musical grunts, trills, coughs and clicks into abstract concepts and ideas that could be both contained within ones own head and passed along to others.

Fifty years have passed since the heyday of this generation of semanticists … McLuhan and Hayakawa both passed away years ago, Chomsky celebrates his eightieth birthday next year (he was born December 7, 1928). The world has changed in ways both subtle and profound since then, but one of the most significant is the fact that semantics has gone from being a branch of philosophy to being a branch of computing. Oh, surely not, you may protest - there are many things far more important than this - computers in general, the evolution of aircraft, the space programs, biotechnology … all must be more important than the shift of an obscure part of philosophy into an obscure part of programming.

Rick Jelliffe

AddThis Social Bookmark Button

Here is a way to express basic Entity Relationship model using ISO Schematron. Schematron allows you to model entities and various relationships using “abstract patterns” (a parameterized macro facility.) You can use the same idea to model other kinds of diagramming and modeling systems.

I think what it quite powerful about this approach is that we can separate the information relationships from the XML serialization. Fields can be child elements, attributes, attributes of the parent, any kind of XPath, we don’t care. Similarly, if two fields are related, they may use a key or containment, but we don’t care. If you like you see this a technique for capturing information from a model in a form which also happens to hook into Schematron validation; but you can use the captured model for non-Schematron purposes too!

(This is an updated version of a post to XML-DEV mail list, in Nov 2006.)

M. David Peterson

AddThis Social Bookmark Button

A couple of months back I got a *TOP SECRET* invite from a *TOP SECRET* laboratory located in a *TOP SECRET* location here in Utah to gain a *SNEAK* pre-view of a *TOP SECRET* browser-based web services development tool. For now, we’ll call this tool “Frank.”

Me: So, “Frank”, tell all the Land’OXML about yourself.

Frank: My names BUNGEE, you phreak. Stop calling me Frank!

Me: *WHOA* Frank! You can’t tell them your real name! You’ve signed an NDA!

Frank: No, *YOU* signed an NDA. I can say anything I want.

Me: Wow. Little snippy today, aren’t we Frank?

Frank: BUNGEE!!! My name is BUNGEE, BUNGEE, BUNGEE!

Me: Well okay then, Frank.

Frank: BUNGEE!

Me: Frank? Didn’t you just say that it was I that was under NDA and not you? Wouldn’t that mean that you can call yourself anything you want, but I can’t?

Frank: That’s a good point. I’ll give you that one.

Me: Thanks! :D I like points. How many did I get?

Frank: Don’t push your luck.

Me: Okay.

Me: So “Frank”, since my tongue is still bound by the legal system, why don’t you tell the good people in Land’OXML about yourself.

Frank: Okay. Well, I’m 6 feet tall, Blonde Hair, Blue Eyes, and

Me: Frank. That’s not what I meant. How about telling them about — you know — who you are, what you do for a living, if you happen to be giving any free seminars in San Francisco tomorrow (the 22nd) and in Orem, Utah on the 29th, and for those who want to get their mind blown with a sneak preview of the next generation of web services development tools to shoot Brad (bradATbungeelabs.com) a *TOP SECRET* email with the code word,

“Frank sent me…”

as the subject.

Frank: Well I would, but *YOU JUST DID ALREADY*

Me: Oh, well… hmmm… That’s a good point. Frank, question: Did I just break my NDA?

Frank: Just give me the mic.

Me: Okay.

Frank: So, a little about myself,

M. David Peterson

AddThis Social Bookmark Button

Find out **.

Rick Jelliffe

AddThis Social Bookmark Button

Schematron is an ISO standard schema language for making assertion about the presence or absense of patterns in XML documents. It has fairly widespread use, from publishing to transport to financial and insurance to health systems, but is not supported by major vendors yet. Schematron is aimed at being a general purpose (rather than domain-specific) rules language for expressing both the kinds of complex structural rules that are beyond the reach of XML Schemas schemas and for expressing simple business rules. Most people use my open source XSLT implementations of Schematron 1.5 (at htp://www.ascc.net/xml/schematron) which is being upgraded to the ISO spec (at http://www.schematron.com), but versions exist from other developers in Python, Perl, C#, and Java.

One of the aims of Schematron was to allow all the constraints in a system to be printed out in bullet list form: literate programming comes to schemas. ISO Schematron allows you to put requirements in free text paragraphs (customer’s view), then to put the natural language assertions that test these in bullet point form (the analyst’s view), then to arrange and mark these assertions up with the appropirate IDs and XPaths (the devloper’s view). This can improve traceability from requirements to analysis to implementation for validators.

But one persistant problem has been that there are often business requirements which are untestable. For example, a business quality requirement that “The document shall be maintainable.” is legitimate but not necessarily the thing that you would use a schema to test. (Actually, now that I think about it, I wonder whether it is possible to put the Document Structure Complexity Metric as an XPath that an assertion tests….hmmm)

And there is another kind of constraint that is not tested but will be testable later: perhaps you haven’t got the XPath skills to create the test, or perhaps it is based on some future event, such as “All dates in this document must be during the US presidency of G.W.Bush.”

So are these kinds of constraints things that can never go into a Schematron schema, or just remain as comment-like paragraphs?

What we can do is have dummy assertions, which never fail and provide a place to park these kind of constraints. Lets make up a pattern for them, and we will use two roles “Untestable” and “Unimplented” to distinguish some of the reasons why the assertion does not have a fallible test.

<sch:pattern>
<sch:title>Untested Assertions</title>

<sch:rule context="/">
   <sch:assert test="true()"  role="Untestable" >The document shall be maintainable</sch:assert>
   <sch:assert test="true()"  role="UnImplemented" >All dates must be during the term of G. W. Bush.</sch:assert>
</sch:rule>
</sch:pattern>

Now the constraints are “part of the system” the same as testable constraints, and their status as untested or untestable (by Schematron) is explicit. There might be other roles too: “RequiresCustomTestApplication” for example.

M. David Peterson

AddThis Social Bookmark Button

I recently wrote an entry to my Dev.AOL blog entitled “Solving FizzBuzz in XSLT 1.0” built upon the premise that in the real world, data changes, and it’s because it changes that instead of thinking of how to solve problems using static data variables, we should instead be trying to solve them in ways that are much more adaptable, and therefore, reusable.

In other words, if your desire is to find someone who truly understands how to write code that solves real-world problems, then use real-world scenarios: Data changes > You’re code shouldn’t have to change with it.

Of course one could argue “code? data? what’s the difference?”, and of course, I would only be able to agree. But none-the-less, there’s still a need to write the initial processing code that will then process and transform the data code into whatever it is it needs to be transformed into, and it’s on this premise I present the following as a picture perfect example of what truly Beautiful Code looks like.

After writing the initial “in XSLT 1.0″ post linked above, I made a post to XSL-List with the following request,

AddThis Social Bookmark Button

First, my apologies for writing a personal introduction, then being silent for the next three weeks. That’s not what I had been planning. The problem, as you might guess, has been too much work, too many jobs, all competing for the same time. And, since I’m an entreprenuer, my work tasks feel no shame about demanding my time late into evenings and on weekends. They tell me I chose this career path, so now I must bear it!

Meanwhile, during those weeks I have thought about posting here again. So, having a bit of unexpected “free” time, and not being the kind of person who finds the “couch potato” kind of relaxation particularly appealing, I just sat down and started writing this. But not until some reading of other blogs stimulated these thoughts.

Technology, Blogs, and Traditional Journalism

Though I have several web sites of my own, the AOL Developer Community is pretty much my home site, these days. So, I was visiting the consolidated blog page there, catching up posts I hadn’t yet read.

As I browsed back into the posts, I came upon “The Power of an Apple”, posted by M. David Peterson (of XML.com fame, of course). I’d noticed the post several days ago, but I hadn’t read it, being preoccupied with project deadlines. I was expecting the post to be about Apple computers, or about some technical innovation wherein an apple is employed to illustrate some technology principle. To my surprise, the apples in this post are simply eaten, and the post describes the effect on the writer of such ingestive activity, along with the negative consequences a deficit of apples seems to induce.

Now, this post is certainly unusual for a technology blog. Some people might even argue that it doesn’t belong on a technology site. But as I read the post, my long-running interior discussion with myself about the relationship and differences between traditional print journalism and blogs was reignited once again. I find blogs and blogging fascinating in many ways. While “serious” blogging is similar to old-fashioned newspaper reporting, it’s also different in important ways. I think it has to do in part with the cost of posting information on a blog, versus the cost of printing on paper. But, my point is that in a blog — not an individual post, but a writer’s blog in its entirety — there is room for a fuller view of the person behind the posts.

Of course, many blogs are exclusively expressions of an individual’s personality (or the image they’d like to convey). I’m really talking here more about “serious” blogs, for example, technology blogs. In the past, in printed newspapers, or magazines, the editorial staff could not permit publication of a piece that would be of interest to only a small number of readers. With the Internet, with blogs, with Technorati and Google and other blog aggregators, the cost of publishing is lower and yet it is still possible for people to find the post.

Of Apples, Alarm Clocks, and Mother Nature

Anyway, finding myself unexpectedly at home today, while I was on hold calling USAirways to try to get a refund for our tickets for our flight (which was cancelled by New England’s very late “Noreaster” snow/sleet storm), I found it very interesting to read:

In the same way the sun rises and brightens each of our days gradually, I believe that our bodies have been trained for millions of years worth of evolution by this same process to gradually come back into the world of wakened conscience.

and:

The smiles back! And thanks must be given, yet again, to Mother Nature, a woman who obviously understands how to take care of herself: No alarm clocks, no additives, and no preservatives.

Reading one of my peers saying such things was enough to give me pause. Hmm.. So, yeah: even though so many of us spend enormous amonts of time thinking about and working on understanding new technologies and their implications; even as we find ourselves facing an almost overwhelming blizzard of information that we know is interesting and we know would be useful to study and understand; even as we see hundreds or thousands of potential good reads fly by each day, their titles barely skimmed, because we know we don’t have time for the pursuit… it is important to somehow try to keep things in perspective.

Seeing this added something to my interior conversation about blogs versus traditional publishing.

The Advantages of Blogging

I’m no psychologist (in fact, I tend to doubt that their formulations are of much relevance in most cases), but I think blogs can play an important role for both writer and reader that was not possible in the case of old-fashioned print journalism. You’re reading along, it’s part of your job to study this stuff, and you suddenly come across a little island oasis, such as “The Power of an Apple.” And it’s refreshing to the reader. Hopefully it was refreshing for the writer as well.

Although some people may complain (”What does this post have to do with this site?”), the Web is such that a mouse click takes you elsewhere if you’d prefer to read something else. But I’d actually argue that a post that brings some revelation of how the author lives, where the energy that is seen and expressed in the technology posts comes from, how life is ordered to support and elicit the creativity that is displayed in the more standard technology-specific posts, is relevant. It’s useful (and also entertaining, to me) to know that three apples a day and a lack of alarm clocks are what is behind posts such as

and a very interesting project project involving

Quite a combination of technologies! And it works on all the major platforms: Windows, Linux, Mac, etc.

Conclusion

Blogging as a venue for technology writing can bring the technology itself more to life, I think, because in knowing something about the writer, who is the “narrator” of the technology posts, you have additional insight into the point of view from which the technology is being surveyed, described, analyzed. I consider that to be valuable insight, which could not normally be published in traditional printed media.

For example, now that I know M. David Peterson doesn’t use alarm clocks and does eat apples, I feel like I understand his other posts and his project just a little bit better. Not that I can throw my alarm clock out the window (though I’m working diligently to try to get to that day). But, we do have some nice Fuji apples waiting out there in the crisper. Hmm…

M. David Peterson

AddThis Social Bookmark Button

Update: So I’ve now got things set-up such that there are several default/built-in collections, and for now have updated the client-side XSLT to access the APP service document @ /service/pub/, access each of the related Atom feed-based collections, and create a simple report that outputs the detailed info for each collection, and if there are any entries (which by default, there is not), will iterate over each entry and output a report of each.

A couple of screen shots to help warm each of your AtomPub-enabled hearts,

Xameleon.Amplee.ClickOnce.F.png

Xameleon.Amplee.OSX-update.png

I also spent some time and quickly added an AtomProvider class to the DynamicWebServiceHelpers project for IronPython provided by Microsoft such that you could load collection feeds served by by Amplee from the IP console app. Take a look at the rss.py sample provided with this same sample project to gain a feel for how to use it.

Oh, I’ve checked the updated code into SVN, and updated the ClickOnce app as well. For more detail (URI’s, etc.), please see the end of this post.

Tomorrow I will be adding in the ability to add/update/delete entries in each collection to then take these entries and mash them up with with any external web feed, outputting the result in a reusable ModuleT to be rendered on any system which supports ModuleT (At present time AIM Pages as well as the built in capabilities I will be adding to the Xameleon code base to render them locally.) If you take a look at the flickr.py or the amazon.py from the same DWSH project, it should become pretty obvious how the combination of IronPython, Amplee, AtomicXML, ModuleT, and LiveClipboard will enable some pretty amazing mashup capabilities that bring together REST, WS-*, APP, and/or any given Atom/RSS web feed, combining them together in any way you can imagine, and making them reusable and shareable with nothing more than a simple copy/paste of the LiveClipboard scissor icon.

Fun times ahead, but for now, however: Sleep ;-)

Bye… :D

M. David Peterson

AddThis Social Bookmark Button

Bill de hÓra: Future proofing

Gregor J. Rothfuss
(March 16, 2007 11:18 AM #)

http://greg.abstrakt.ch/gallery2/d/24336-1/googleflat.jpg

After clicking the above link left as a comment to Bill de hÓra’s recent “Future proofing” post, it took me a second, but then realized why this most definitely deserved credit as Photo of the Day.

Google: Deliver this, and your Goose is forever Golden; MSFT: Deliver this and you’ll cook Google’s Golden Goose for this years Thanksgiving Dinner.

’nuff said. ;-)

Rick Jelliffe

AddThis Social Bookmark Button

I had a nice email from a person involved at the highest level with ODF yesterday, saying he didn’t think I was being extreme in my recent blogs about contradiction at ISO. Very encouraging and gentlemanly. He also said Well, the gratuitous comments about people whipping up passions may have been a bit much but … I was sure you are as tired of the hype as I am. Quite so.

Kurt Cagle

AddThis Social Bookmark Button

Call it Len’s Proposition … I’ve been taken to task recently by Len Bullard for my unflagging support and belief in open standards in general and XML in particular. I respect the many voices here on XML.com, especially Len’s, so when he starts saying the sky is falling I generally at least look up, but it occurs to me that this offers an opportunity for many of those same commentators to express their positions about the big issues about XML. With AJAX and JSON in one corner, .NET and Linq off in another, Java sitting impatiently in a third, not to mention a host of languages such as Scheme, Lisp, Haskell, etc., just waiting to get their boxing gloves on, XML’s position may be far from secured. So I think the ultimate question I’d ask here is simple:

Is XML doomed? Is it fatally flawed, too weak, not weak enough, too abstract, too specific? Is the core philosophy that it enables, the principle of open standards, a far-left communist plot or the salvation of the computing world as we know it? Have we gone down the wrong path, and only determined action now will right that wrong?

I’ll weigh in with my own opinions in a bit, but I open the floor to one and all … was XML a mistake? If so, why, if not, why not? (You may use a #2 pencil for your answer).

M. David Peterson

AddThis Social Bookmark Button

Update: Note to self and others: When attempting to setup a recursive planet using Venus (e.g. http://planet.xsltransformations.com/river includes http://planet.xsltransformations.com/ and http://planet.xsltransformations.com/flood includes http://planet.xsltransformations.com/river) don’t use the same on-disk location for the feed cache.

The result if you do? Hours of wondering how on earth entries with the author listed as “Planet XSLTransformations + River” are making it into the root of the planet.

Doh!

[Original Post]
Holy Hannah!!! I guess I should have realized that adding both Technorati[xslt|xsl|xsl-fo] and del.icio.us[xsl|xslt|xsl-fo] to the mix would have resulted in the flood of XSLT-related material that it did, but as with all things in life, experience is what helps make you a wiser human being.

So,

http://planet.xsltransformations.com/ = A “stream” of XSLT-related material from a pre-determined list of XSLT hackers, technologists, and overall community members.

http://planet.xsltransformations.com/river/ = All of the above + del.icio.us[xsl|xslt|xsl-fo]

http://planet.xsltransformations.com/flood/ = All of the above + Technorati[xsl|xslt|xsl-fo] + del.icio.us[xquery|xpath|linq]

I’ll be tinkering with the last two until I get it to what seems like the right level of river and flood status. If you would like to help in my attempt to control the flood gates, by all means, please do.

Please note: If your feed reader looked anything like mine after the initial flood, my most sincere apologies! Hopefully things will be a bit more under control now.

M. David Peterson

AddThis Social Bookmark Button

Update: Miguel has provided a *FANTASTIC* follow-up to a question posed by Robert,

Well let’s hope Miguel changes from Mono to Java now that Java is going GPL.

I personally questioned why on earth he would want to do that when Mono provides support for not only the .NET languages, but for Java itself via IKVM.NET, but Miguel has taken it several steps further by not only providing an extended understanding of why he feels Mono/.NET is the better platform, but most importantly (from a Linux-Geek perspective, which Miguel quite obviously is), why it’s the better choice in regards to the promotion of Linux as a platform. He then concludes with,

trollbait: I think that if anything, now that we got a free java in the pipeline, the free java community can focus on improving Mono :-)

Beyond stating that I most definitely agree with Miguel, I’ll let the rest of y’all take it from here, providing the following question to build and extend from,

What’s the point of having a platform in the first place, if you don’t utilize that platform to its fullest potential? For example, Windows has its strengths, and its weaknesses. The same is true about Mac OSX. The same is true about Linux. Why approach things from the standpoint of vanilla, when there’s chocolate, strawberry, vanilla almond fudge, and *OH SO MUCH MORE* than just plain-old vanilla?

That’s not to suggest that vanilla is a bad flavor in the same sense that chocolate, strawberry, or vanilla almond fudge are bad flavors. It’s really up to you which you prefer, right?

So then: What’s your favorite flavor? and why?

Thanks for the follow-up, Miguel!

Update: Manu Stapf followed-up this same post recently with the following,

If you read the original article till the end (http://www.apostate.com/
programming/bm-freesoftware.html) you will realize that Bertrand Meyer
was actually proposing a new way (thus the title of the article) to
embrace/recognize/encourage/promote a vision of open source software.

I’ve appended to the end of this post Bertrand Meyer’s “A COURSE OF ACTION”. I am still trying to consume it all internally, but I have to admit that at first glance it most definitely seems to hold significant value as source of inspiration in regards to how to approach the world of software, both open and closed, commercial and non-commercial alike moving forward.

Thoughts from the community at large?

[Original Post]
Eiffel now Open Source on 13 Mar 2007 - tirania.org blog comments. | Google Groups

Not everyone understood open source the day it was launched.

Look at Sun and Java, they have a history of decades of resisting the
open sourcing of Java, it took a long time, but they eventually
changed their mind.

Like someone said “wise people change their minds” ;-)

Miguel.

M. David Peterson

AddThis Social Bookmark Button

http://planet.xsltransformations.com/ [feed, opml]

I’m still busy adding feeds from folks in whom most definitely provide value to those with interest in XSLT and/or similar technologies such as XQuery and LINQ. I was going to hold off from announcing this until I was absolutely certain I had everybody on there that obviously should be on there. But since that will more than likely never happen within a reasonable time frame, and since the best way to fix something is to give it to people to play with such that they can tell you where it’s broken (and/or how they broke it, though in this case (I hope!) this really doesn’t apply), I figured now is the time to announce its existence, using a collaborative approach to filling in the missing links. If it seems to you there is someone missing, please let me know. Thanks!

Update: Please note: If you visit the site expecting to see your name+link, and for some odd reason it isn’t there, please don’t take it personally. I do stupid things like this *ALL* the time. For example, it took me close to a year before I realized I didn’t have Elliotte Rusty Harold listed on the XSLT:Blog “Legends of the XSLT Community” roster. With this in mind, please forgive me, and let me know of my evil sins so I can properly repent. Thanks!

In addition to adding new folks as time continues, I also plan to begin the integration of the client-side XSLT framework I have been developing, some of which you can find @ http://extf.googlecode.com/svn/trunk/WebApp/public_web/ (and play with @ http://browserbasedxml.com/ ). This same framework you will be learning about in the now quite belated Open Source XML Weekly Roundup (which will now be titled “Week 9 1/2″ when it’s ready to go either later today or tomorrow ;-)

More on that when it arrives ( < obviously! ;-)

Oh, and also,

M. David Peterson

AddThis Social Bookmark Button

Microsoft guns Open XML onto ISO fast track

March 12, 2007 (Computerworld) — The International Standards Organization (ISO) agreed Saturday to put Open XML, the document format created and championed by Microsoft Corp., on a fast-track approval process that could see Open XML ratified as an international standard by August.

That’s despite lingering opposition to Open XML by several key voting countries, including some of whom whose governments are moving forward to adopt the alternative Open Document Format for Office Applications (ODF) format, which the ISO approved as a standard last year.

Hmmm… Interesting… A bit further down we discover,

Rajchel wrote that she decided to move Open XML forward after consulting with staff at the International Technology Task Force. She did not mention that the 6,000-page proposal, submitted by another standards body, Ecma International, had garnered comments and criticism from 20 out of the 30 countries sitting on the JTC-1 committee.

Hmmm… Interesting… A bit further down we discover,

Rick Jelliffe

AddThis Social Bookmark Button

Oh, I guess that would make me me pretty much right then…

The excellent Erc Lai at ComputerWorld is reporting that JTC1 has accepted Ecma’s responses to the recent contradiction review, and that matters are proceding to the next phase of the voting as normal. It is worthwhile reading these responses, for people interested in openness or getting the other side to the story. (The rest of the post is based on the assumption that his report is correct.)

I think I can speak a little more frankly now. I didn’t want to inflame the situation before. I think certain stakeholders quite cynically or gleefully inflamed passions and expectations unnecessarily by utterly inflating the importance and urgency of the contradiction phase. I mentioned in my blog my opinion that contradiction will mean something quite limited and specific, that consistency with previous practice is a guide to what to expect, that a 30 day review really is only intended to catch the broad problems that could be found by a quick skim not detailed technical issues, that detailed technical issues get dealt with by the six month review, that JTC1 would not take on economic or political arguments, and that the whole process is geared towards resolving issues (win-win) not win-lose. The big vote is the FDIS national ballot final national vote at the Ballot Resolution Meeting, not the 1 month period.

What we don’t see from these stakeholders is “Oh I guess we were quite wrong on what a contradiction is; oops.” Instead, we are starting to see a shoot-the-piano player reaction; will we now see attacks on the individual officers, on the secretariat, on the ISO procedures, blame on everything except the bad advice? I see somone lamenting “Have we no standard for standards?” but that is what the JTC Directives are. I think they will now have to whip up emotions that the process is flawed, so that their true believers won’t start thinking “oh, perhaps there are two sides to this story” or even “oh, perhaps being anti-MS and pro-ODF does not require me to be anti-ISO OpenXML”.

The thing is that the contradiction review is actually a small formality to prevent wasting time and to escalate certain problems to JTC1, not THE ONLY CHANCE TO STOP THE MONSTER! The focus of the process is first to get issues raised, ascertain that Ecma has responses that indicate the matter can move forward using the normal process, and then to get to the more detailed technical review stage.

So the Ecma spec is now at the “Draft International Standard” stage (DIS).

In five months time, as I understand it, there will be a ballot of the national bodies that are P-members of SC34 (participating members, rather than observers). In ISO procedure, there are votes for “abstain”, “yes”, “no” and “no with comments”. I expect that OpenXML will have a lot of “No with comments”, which happens sometimes. These comments give all the technical (and perhaps IPR) issues that have been found that are showstoppers, together with other misc comments.

After this ballot, we all wait two or three months. This gives everyone time to draw their breaths, gird their loins, examine each other’s positions, and prepare responses. Then there a ballot resolution meeting, at which all the issues are dealt with; the meeting may respond with a fix to the DIS, or with a comment that this is an issue for further study and enhancement, or they may to decline to fix the problem, or they may say that it is not a problem in their opinion. Then Ecma the editor takes these on board, makes the appropriate changes, and this becomes the FDIS, the “Final Draft International Standard”.

[Update: 2007-05-17. A few weeks before I wrote the following, the JTC1 Directives were altered, which I (and others) had not become aware of. The procedure has changed slightly, so that the final vote occurs at the end of the ballot resolution stage, *before* the editor creates the final draft which ultimately is reviewed by ISO ITTF. The final vote is still on national lines, but it occurs at the meeting, not subsequently. So while the time until the standard is published is still pretty much unchanged (12 months-ish), the time for review looks like being 9-months-ish.]

The FDIS then gets sent around to the national bodies, and a vote is taken after 30 days. So we are talking, 1 month admin review (contradiction) + 1 month (Ecma response) + 5 months (detail review) + 2month (collection) + 2 months (resolution) + 1 month (pre-vote). A Fast Track standard with any controversy actually has about 13 months of review time before the final vote minimum! Now contrast this with the panic from the “30 days is too short” crowd. I cannot believe the instigators didn’t know better: even a fast-track procedure is lengthy, open and serious.

So does that mean that NZ, Canada, Singapore, Kenya, and so on were wrong to raise their issues? Not at all. It is good to have all the issues on the table, and opinions can legitimately differ, especially in the absense of explicit guidelines or precedent on what contradictions are. I expect that the JTC1 secretariat and the ITTF would have a very limited class of issues that would come into their perview, and that otherwise they would punt to the normal mechanism. Indeed, as was the case with Australia, it is good that they pass on the various comments earlier rather than later, so that they can be responded to.

What is important to realize is that even though JTC1 may have found that Ecma’s responses to the contradiction issues are satisfactory enough to let matters proceed, it does not mean that therefore the issues themselves go away. It is not JTC1s job to make technical decisions but procedural ones, is one way to put it. I expect that some of the issues will be re-put at the DIS ballot, some will have been answered by the Ecma response already, and there will probably be some more. The DIS ballot resolution process looks like being a long and difficult job, but it will make the claims that OpenXML had no input from an open process fairly untenable; probably ODF should have had the same amount of scrutiny.

Michael Day

AddThis Social Bookmark Button

Okay, I’ve heard jokes about people parsing XML files backwards, starting at the end of the file and reporting SAX events in reverse document order, but it seems that someone has actually gone and done it. The justification sounds almost plausible: an instant messaging client (Adium on the Mac) that writes out XML message log files and uses backwards parsing as a method for retrieving the last N messages in constant time, regardless of how many messages the file contains in total. However, it’s crazy to think of doing this for XML in general.

Rick Jelliffe

AddThis Social Bookmark Button

The former State Government CIO of Massachusetts (a state in USA) Louis Gutierrez has a worthwhile interview in Computerworld this week. What I like about the article in particular is that he seems clear that the role of govenment is not just to passively accept standards, but to pragmatically assess the suitability and impact and processes of each one. ISO makes “voluntary standards” not laws.

Gutierrez says the obvious: that there is a marked difference in the feature set between ODF and OpenXML (”straightforward simplicity” versus “feature-rich but very idiosyncratic diversty”): he does not trivialize the difference in coverage or scope between the two. More interestingly, though he is obviously not a pluralist at heart he does acknowledge that there may be legitimate benefits in plurality (especially if we take the strategic view where we are trying to set up the pre-conditions for full-function office standards, to benefit users rather than hungry vendors.)

…The trouble with people who say “You are ignoring the elephants in the room” (i.e. that the utterly dominant thing is the corporate and political machinations) is that the room actually contains more than elephants…at the end of the day, when the elephants have left, some poor schmuck has to clean up. The hard technical issues cannot be ignored: which applications work?, can the format carry the information we need? for each area, what is the allowed trade-off between fidelity and interoperability? What validation is being performed? Which version of the standard is being required? Which profile? Which test suite? What encouragement is being given to Open Source developers (e.g. KDE)?

M. David Peterson

AddThis Social Bookmark Button

In regards to Daylight Saving Time here in the United States, this year: It’s both! 3 weeks back, and 1 week forward to be more precise, accounting for 4 more weeks worth of sunshine to brighten each and every one of our days.

Unless you live in Seattle**. ;-)

via the U.S. Naval Observatory Astronomical Applications Department,

Michael Day

AddThis Social Bookmark Button

Following on from Kurt’s detailed reevaluation of XSLT 2.0, I thought that I might share an example of what you can do in XSLT 1.0 with the assistance of EXSLT, a useful set of extension functions that are supported by most XSLT implementations.

Kurt Cagle

AddThis Social Bookmark Button

I recently wrote a blog about the directions that I saw with XML, and while it has proved to be fairly popular, it has also generated a fair number of comments that really need their own more detailed examination. One of these, and one that I’ve been planning to write for a while anyway, has to do with my comments about XSLT 2.0 increasingly being used as a “router” language, replacing such applications as Microsoft’s BizTalk Server.

This is not a disparagement of BizTalk - it’s actually one of the Microsoft technologies that I have actually endorsed on a regular basis, because it solves one of the thornier issues involved in creating complex data systems - how do you handle the intermediation of data coming from different data sources, and while I have some quibbles about the interface, I think BizTalk does its job admirably. It also served as a bridge technology for quite some time between the SQL and XML worlds, and it will continue to serve in that role for quite some time to come.

However, I also think that in the long run it is a bridge technology, and that as the world moves increasingly to the use of XML as the preferred data transport story, other technologies, such as XSLT 2.0, will likely end up making most of it functionality redundant and useful at best on the edge cases. Such statements presuppose a lot, of course, most notably that XML is in fact becoming the dominant form for expression of data content. Before digging into XSLT2, I’d like to address this issue head-on, as it ties into a number of other things I’ve been looking at of late.

M. David Peterson

AddThis Social Bookmark Button

… and went to Book Publisher Sales Heaven!

Teens buying books at fastest rate in decades

Like a lot of teens, Leslie Cornaby has a crowded schedule — her days crammed with homework, hobbies and an array of techno diversions. When she’s not checking e-mail, she’s cruising YouTube or scrolling her iPod to tunes by Pink or Christina Aguilera.

She’s also reading — just for the glorious fun of it — and says, “Most of my friends are readers, too.”

The Shorecrest High School sophomore may not realize it, but she’s enjoying the fruits of one of the most fertile periods in the history of young adult literature.

It’s a time of strong writing and strong sales as readers in the 12-to-18 age group rock the marketplace.

“Kids are buying books in quantities we’ve never seen before,” said Booklist magazine critic Michael Cart, a leading authority on young adult literature. “And publishers are courting young adults in ways we haven’t seen since the 1940s.”

Wow!

Not only are teen book sales booming — up by a quarter between 1999 and 2005, by one industry analysis — but the quality is soaring as well. Older teens in particular are enjoying a surge of sophisticated fare as young adult literature becomes a global phenomenon.

Double *WOW*!

M. David Peterson

AddThis Social Bookmark Button

Update: via Rick Jelliffe,

I think this is really promising. The WhatWG material will undoubtedly be the prime inputs for consideration. It would be nice if the WG had its minutes published in the open, given the public interest.

As I pointed out in a follow-up, while the minutes are obviously different, the archives for the mailing list are labeled “public-html”. 12 posts and counting, and yep: They’re public > http://lists.w3.org/Archives/Public/public-html/2007JanMar/

Thanks for your input, Rick! I most certainly hope your thoughts regarding the work of the WHATWG are exactly how things play out.

[Original Post]
I don’t always have the kindest things to say both to and about the W3C, often criticizing them for their closed door, closed ear policy when it comes to listening to what the developers in whom use their technologies have to say on any given matter.

This time around is different,

W3C Relaunches HTML Activity

http://www.w3.org/ — 7 March 2007 — Recognizing the importance of an open forum for the development of the predominant Web content technology, W3C today invites browser vendors, application developers, and content designers to help design the next version of HTML by participating in the new W3C HTML Working Group. Based on significant input from the design and developer communities within and outside the W3C Membership, W3C has chartered the group to conduct its work in public and to solicit broad participation from W3C Members and non-Members alike.

“HTML started simply, with structured markup, no licensing requirements, and the ability to link to anything. More than anything, this simplicity and openness has led to its tremendous and continued success,” explained Tim Berners-Lee, W3C director and inventor of HTML. “It’s time to revisit the standard and see what we can do to meet the current community needs, and to do so effectively with commitments from browser manufacturers in a visible and open way.”

Good on ya W3C!

Michael Day

AddThis Social Bookmark Button

Today we released a new alpha version of Prince that supports a new CSS property that we cooked up for performing text replacement. It is loosely based on the tr command in UNIX and Perl and was inspired by a specific use case of wanting to replace straight apostrophes (’) with curly right single quotes (U+2019), which are more aesthetically pleasing in some fonts and harmonise better with the use of curly double quotes. With the new property, this can be achieved like this:

Rick Jelliffe

AddThis Social Bookmark Button

Kitchen-sink standards are developed by committees and have to cope with a wide variety of different applications. If someone’s software does something, there has to be some element or attribute or value stuck in. Sometimes the backdoor of properties (open ended value lists) is used, so that the schema can be simplified at the expense of enumerating possible values. But schemas like DOCBOOK, TEI, ODF and OpenXML are classic kitchen sinks.

There is an objective way to detect them: check their Structured Document Complexity Metric and if it is over 300, you probably have a kitchen sink. I gave some metrics earlier in Comparing Office Document Formats.

Now the trouble with kitchen-sink schemas is that any particular set of documents will only use a subset of the total possible features. So writing a complete converter that accepts any possible input from a kitchen-sink schema and outputing them to some more targetted document type is a completely wasteful process. YAGNI. But, and here’s the rub, every so often, someone will in fact use one of these strange often, someone will in fact use one of the elements you didn’t expect.

One way to cope with this is the usage schema. This is a schema derived from sampling representative documents. When new documents come in, you first validate them against the usage schema, and if there is a problem, escalate it to the roject management to schema, and if there is a problem, escalate it to the roject management to discuss how to handle it. It is a sign that the data is not what they expected.

There are some tools to generate XSD usage schemas, but you can also generate them using Schematron. The tool I use first generates all three-level Xpaths found in the document, then makes a Schematron schema that reports if any node was found that was not caught by these XPaths. Very straightforward, but effective.

Another use for usage schemas is for software development. If the customer has provided a sample of the output format, then make a usage schema for that and check that the output from your converter validates. Escalate any differences to project management, This gives a way of proving that your program meets their specs, and also of showing where their specs (e.g. the sample output) was inadequate.

Michael Day

AddThis Social Bookmark Button

Hello, my name is Michael Day and I’m here to blog about XML, CSS, web standards, declarative programming, UNICODE and other topics of interest to XML.com readers. Since a lengthy biography of me is not one of these topics, I shall limit myself to one sentence: I am the founder of YesLogic and the designer of Prince, an XML + CSS formatter and a great way of getting web content onto paper.

Now that we’ve got that out of the way I would like to get straight into talking about XML parsing and UNICODE encodings. In Prince we use libxml2 for all of our XML and HTML parsing needs, and have been very happy with it. However, it’s always interesting to see new approaches for XML parsing that may offer greater speed or convenience than existing methods.

M. David Peterson

AddThis Social Bookmark Button

You are my new hero!

Don Dodge on The Next Big Thing: Microsoft lawyer rips Google on copyrights - Why?

Oh boy, here we go. Microsoft attacks Google on copyright regarding their book scanning project, and then takes a swipe at YouTube as well. Really dumb move! What are these Microsoft lawyers thinking? Even if they are right, which is debatable, what reaction do they expect from the public at large? This strikes me as pandering to the Association of American Publishers where the Microsoft lawyer is speaking today. Here is a transcript of his speech. The speech is actually pretty good until he drives over the cliff and starts slamming Google.

The AAP filed suit against Google for copyright infringement 16 months ago, and it is still in the courts. What is to be gained by making these inflammatory comments? Be quiet and let the courts sort this out.

NOTE: For those unaware, Don Dodge is a lawyer. Apparently I was mistaken. As I followed up in a comment below.

All of this time, and I thought you were a lawyer? Yikes! I think it must have to do with your “say it like it is” approach to the business world (which I *GREATLY* admire, btw…), and your tendency to tackle the tough, legal related issues that most ’softies tend to steer clear from.

ADDITIONAL-NOTE: For those unaware, Don Dodge works — as an *lawyer* inspiration to us all — for *Microsoft*!

Dear Microsoft,

Rick Jelliffe

AddThis Social Bookmark Button

Geekfodder!

Rob Cameron, who is a professor at Simon Fraser University, has released u8u16 in open source beta, a really exciting library which implements an “iconv” like transcoder (i.e. it converts data from one character set and encoding to another), and which uses the SIMD instructions that modern CPUs have.

I think I was the first person to write something on this technique, certainly on the Internet, in my blog item Using C++ Intrinsic Functions for Pipelined Text Processing a couple of years ago, but only because the idea was too obvious to people involved with DSP to write about, I gather: of course you can use instrinsic functions for text processing! My code just used C++ intrinsics as an optimization on top of C++ code. But Cameron takes it to another level: his code abstracts out the features of the most common SIMD devices so that his algorithms can be arranged to work on this abstraction and compile to a wide range of targets processors, and he can dispense with the code. He reports 4 to 25 times speed increases, depending on the data; which is very promising.

I would love to see an XML parser that combines Cameron’ SIMD work with the optimizations from IBM’s XML Screamer, which seem to increase the speed of Java processing by two or three fold. Cameron’s work is important because it gives a working abstraction that can inform decision-making on buiding SIMD-using capabilities into Java’s text processing.

M. David Peterson

AddThis Social Bookmark Button

Holy Hannah, is it Friday already!? Actually, I think technically it’s Saturday where I’m located @ in the shadow of the Wasatch Mountains, but the O’Reilly servers are in the Pacific Time Zone, so when I hit post, you won’t realize that I actually missed my self imposed Friday night deadline > Of course, now you do.

I need to learn how to keep my mouth shut.

So here’s the deal: I am holding out what I had planned for today (which is a *BIG PHAT* demo of the wonderful new ccHost 4.0 release) because it exposes one of the underlying sub-projects of the the Viberavetions Project, and I haven’t had a chance to discuss with the BizDev folks whether or not they would have a heart attack if I were to somewhat launch one these aforementioned sub-projects without their blessing.

My guess is “YES!”.

Therefore we wait… ;-)

That said: Holy Hot Flash! < What's Google got cooking over there in the land of Google Code?

Peter Fisk speculates,

Kurt Cagle

AddThis Social Bookmark Button

I’m sitting on the Queen of Sannich, one of the older BC ferries, on its way to Vancouver from Victoria. A gentle fog envelopes the surrounding water, turning the small islands that we pass into abstract hills fading into the distance and the water itself into pale white sheets broken only slightly by the turbulence of our passing. Earlier, we had a spitting of snow, a last lingering reminder that winter gives up its presence only very reluctantly, even into these first days of March.

I usually enjoy these trips on the ferry, for all that they add considerably to the complexity of returning to the mainland. They are times for reflection, for helping to put things into perspective. Reflection, and the time for it, is becoming a rare commodity in this world. We move at “Internet Speeds”, twenty-four hours a day, caught up in the big now, yet in all of our trumpeting of technological advance we often lose sight of the fact that the truly profound discoveries and realizations of humanity did not come from the middle manager, from the hyperactive programmer or the driven politician. Instead, they came from people who sacrificed some or all of their worldly drive for goods and status in favor of spending time in reflection, for taking the time to truly think.

Rick Jelliffe

AddThis Social Bookmark Button

(NOTE: Since this blog, Computerworld published Ecma’s fulll Contradiction Response which includes summaries of the national bodies’ substantive comments. Here is an updated table with positions as apparent from the Ecma responses. I count 7 rather than 8 actual claims of contradictions, but many of the national bodies recommend shifting OOXML into some SC34-based review.)

Here’s the latest on the rumoured positions taken by the national standards bodies that are full participating members of ISO/IEC JTC1 (P Countries). We’ll know more over the next few weeks as material comes online. I’ve summarized things in a following table as best as I can make them out, but (apart from Australia’s comments which I have seen) I’m not too confident in my source, another website.

The responses have two aspects. First there are responses connected to the amount of time available to check for contradictions. Now this is really an ISO procedural matter, and, as I have mentioned before, in effect national bodies get much less than the 30 days to check for contradictions: really it is as little as a week. But it doesn’t matter, because the national vote comes up anyway: as I’ve said before, the contradiction period is a coarse sieve for big issues. At least seven of the thirty national bodies from P Countries have made remarks concerning the time period. I expect JTC1’s answer will be: if this is important, raise this at JTC1 committee.

So second are responses on contradiction proper. It seems that eight (update: 7) of the thirty P Countries have raised issues on contradiction. Another four have passed on issues without necessarily claiming contradiction (in some cases because their procedural comment is that they are not clear on what a contradiction entails.) This is a big number, but given the controversy it is not surprising, and getting the important issues discussed sooner rather than later is in everyone’s interest.

While some of the technical claims are silly (such as the bitmask rubbish) and can be resolved fast, there is an interesting procedural problem: traditionally, when there is some market need for different technologies or approaches to address the same goal, they just get made different parts of the same standard, which lets ISO pretend there is only one standard but actually to allow internal competition under the same number. But that approach is hardly possible for fast-tracked standards, since they come in from different organizations.

Apparantly Ecma has prepared responses, which will be sent to JTC1. Clearly they need to make the case better why OOXML is different from ODF and why there is a market requirement for it (and perhaps why ODF will not be mature fast enough to be usable: fast-track is supposed to be used when there is some aspect of timing where the market requires something fast.) Assuming OOXML survives this round, Ecma only will need to convince one or two countries not to vote “no” and try to get enough of the others to vote “yes”. (8/30 negative is the magic number for preventing a standard IIRC.) I suspect a name change for the proposed standard and some better word-smithing of the scope paragraphs would go a long way to resolve matters.

M. David Peterson

AddThis Social Bookmark Button

Where is XML Going? - O’Reilly XML Blog

“I do not think that JSON is going to “replace” XML; what I do see though is perhaps the dawning realization that the XML Infoset does not in fact have to be represented in angle-bracket notation”. I very strongly agree with that. ‘XML’ will come to mean the Infoset (or the XQuery data model, or tree views and XPath-like axes over object graphs) more than the bits on the wire format. That liberates XML tools to support JSON, various binary XML formats, HTML tag soup, etc. without insisting that everyone play by the XML syntax rules.


Michael Champion
| February 28, 2007 07:59 PM

So I’ll just come out and say it: XML doesn’t matter!

Okay, yes it does. But not in the same way people seem to thing it does. In this regard, I agree 100% with what both Kurt and Mike (Mike’s comment stems from Kurt’s recent above-linked post) have to say on the matter.

In a follow-up comment a while back, I posed the following question,

Advertisement