François Schiettecatte’s Blog

Chromulated Thoughts

Posted in General, Search, User Interface Design by François Schiettecatte on November 29, 2009

So a friend and I have been batting a few emails back and forth about ChromeOS after I downloaded a disc image and played around with it for about 30 minutes. I also listened to some of Paul Thurrott’s thoughts on it on Windows Weekly and went back to play with it a little more. Generally I found it to be pretty good if a little slow, but this was running on a virtual machine and it is a year away from release.

So here are some chromulated thoughts about why this is a big deal:

  1. This will open up new competition around the OS at the netbook end of the market. The first netbooks shipped with various versions of linux but Microsoft sensed that this was an issue and made sure that Windows XP and then Windows 7 became the dominant OS on those machines by bringing down licensing costs for netbook vendors. ChromeOS will re-open that conversation because it is free and it has Google’s backing. One possible outcome is that Windows will become free on netbooks.
  2. This is one of those rare moments when Microsoft is up against a competitor (Google) that is as motivated and has deep pockets as it does. Motivation and money will enable Google to keep developing and pushing ChromeOS past the point where other might give up. Microsoft is not a stranger to this strategy having been well served by it in the past where products did not hit their stride until the third or fourth version.
  3. Google is being very careful to present ChromeOS as something a user might run on a secondary computer, thereby avoiding directly challenging Microsoft (we all know what happened to the last company who did that!)
  4. Apple will probably not be affected by this since it is not (yet) present at that end of the market. An Apple notebook is not something people looking to buy a netbook will consider.

Longer Search Strings

Posted in Search by François Schiettecatte on November 28, 2009

I have been involved in search for a while (on quite a while), and first heard about search string lengths at the 1993 SIGIR conference at Pittsburgh. I believe it was Doug Cutting who was presenting some data on that (my memory is hazy here) and how we were slowing getting up to 2 search terms per search. I think this was an ad-hoc presentation so I have no reference for that, but I know that Amanda Spinks has done a number of studies on search lengths (Google search).

Steve Arnold believes the current number to be around 2.5 which I think is on target, and the trend is up.

Steve points to this study on Hitwise which show those trends, though I would have liked more depth to the figures so we could really see the trend.

Which is, in fact, hard to see, because the change is so slowwww.

At Feedster we were seeing around 2 terms per search, and I put in code to boost the relevance of documents where terms were very close together (ie a phrase) to get better results.

Circumvallation

Posted in Scaling, Search by François Schiettecatte on November 28, 2009

I am have been reading Steve Arnold’s weblog on search, I have known Steve for over ten years now and he likes to challenge the status-quo and pushing people to see beyond the status-quo.

So it was very interesting to read his post about how Google is challenging Reed Elsevier and Thomson by indexing legal texts:

Google has added the full text of US federal cases and state cases. The coverage of the federal cases, district and appellate, is from 1924 to the present. US state cases cover 1950 to the present. Additional content will be added; for example, I have one source that suggested that the Commonwealth of Virginia Supreme Court will provide Google with CD ROMs of cases back to 1924. Google, according to this source, is talking with other sources of US legal information and may provide access to additional legal information as well.

His thesis is that the incumbents are like Vercingetorix stuck in Alesia (1) and that Google is like Ceasar who built two sets of wall around Alesia, one to keep the Gauls in, and the other to keep any relieving force out.

I like the analogy though it is not quite there, Google is not exactly laying siege and they don’t have to defend themselves. On the other hand the incumbents probably feel very much like the Gauls stuck in Alesia.

I was catching up with a long time friend earlier this week (a much smarter person than me), we were talking about lots of thing and one of those things was how particular species will move fluidly from one niche to another as they evolve. My feeling was that sometimes this happens in a fluid fashion without much struggle, but sometimes it can be quite violent resulting in the decimation of one or the other species (2). I wonder if this is closer to what is happening here. Google in moving in on an established market, though not in an explicitly deliberate fashion, and causing discomfort to the incumbents.

Now that a good portion of the data these incumbents charge for is available for free (it always was available for free, but access was difficult), it will likely force them to change their business model if they are to stay relevant. Steve makes that point very explicitly at he end of his post, for example:

Finally, what will be vulnerable to Google disruption will be difficult to use, expensive, and incomplete services. Maybe Reed Elsevier, Thomson Reuters, and Wolters Kluwer should merge. That will give the present crop of senior managers time to cash out. I don’t see an easy, quick, inexpensive, or painless way to prevent the lessons of Alesia being writ large in tomorrow’s digital headlines.

1 – I learned all about the siege of Alesia at when I was at school.

2 – For example, the introduction of Lionfish in the Caribbean is resulting in major population reduction in some indigenous species on the reefs.

Short Battery Life on the Magic Mouse

Posted in Apple by François Schiettecatte on November 27, 2009

I got a new Apple Magic Mouse on November 5th, and had to change the batteries in it today, after only 22 days of use.

Chrome OS

Posted in General, Software Development by François Schiettecatte on November 22, 2009

I have been snowed under with work lately and was only able to spend a little time playing with Google’s new Chrome OS.

I did find a very good review by Paul Thurrott though, well worth reading.

The Shark Swam Right Over Me

Posted in Scuba by François Schiettecatte on November 22, 2009

This pictures was not taken by me but by one of the crew on the boat I was on. I was taking pictures of some Nurse Sharks and some Caribbean Reef Sharks which had decided to come and check out the ‘attractor’ we had put out for them.

Interestingly the nurse sharks were the most aggressive ones with the ‘attractor’ trying to suck out the fish skin that was in there. Three nurse sharks had showed up, two smaller ones and a larger pregnant female. Somewhere along the dive I have gotten very close it as it was laying down next to a coral head. But in this part of the dive it was checking out the ‘attractor’.

I tend to get pretty close to sharks (much to the concerns of some) and this was no exception. The larger nurse shark turned away from the ‘attractor’ and swam right over me which was very cool. Unfortunately the picture is a little dark.

There are two other pictures worth checking out, here and here, there are both wide angle shots so the sharks actually look further away from me than they actually are.

While you are at it, you might want to check out my good friend JenFu’s underwater photographs, he recently took a trip to the Galapagos and to Cozumel.

Fedora 12

Posted in Software Development by François Schiettecatte on November 19, 2009

Just upgraded to Fedora 12, much nicer than Fedora 11.

And the upgrade process was smooth which was welcome.

InnoDB Compression

Posted in Scaling, Software Development by François Schiettecatte on November 19, 2009

I had a few hours to spare a couple of days ago and decided to check InnoDB Plugin 1.0’s support for data compression.

In a project I work on from time to time, there is a table which contains three blobs which contains text data. To store the data efficiently I was using the COMPRESS() function in MySQL and doing a “CONVERT(UNCOMPRESS(text) AS utf8)” to uncompress the data and present it as utf8. No problems there, but with the recent move to the InnoDB Plugin 1.0 in MySQL 5.1 there was an opportunity to push that down the stack.

I ran a few benchmarks and it turned out that using 8K pages was the optimal trade-off between space and time. Using 16K pages did not compress the data very well, and using pages smaller than 8K increased the time needed to store the data. I should note that 8K is also the default.

There are some interesting wrinkles in all this, innodb_file_per_table needs to be enabled and I think the innodb_file_format needs to be set to ‘barracuda’ thought I am not sure about that.

New Magic Mouse

Posted in Apple, User Interface Design by François Schiettecatte on November 12, 2009

So I got the new Apple Magic Mouse and I am finding it to be very nice. I also liked the previous Mighty Mouse a lot too, but I think I must have been one of the very few. I have small hands, have RSI and some arthritis, and for some reason the smaller mouse worked best for me.

The Magic Mouse works pretty well for me, though one could feel that the software is half finished and that the multi-touch surface could do a lot more than it actually does.

For some reason I could only use the wired version of the Mighty Mouse, I experienced a lot of lag and ‘float’ with the bluetooth version for some reason. The Magic Mouse has really tight tracking and no ‘float’ which is very nice. It has been a while since I used a wireless mouse that works this well in this respect.

Finally there is the small matter of power consumption, I bought mine on November 5th, it is now the 12th and the batteries are down to 70% already. That is 30% over a period of 7 days, which suggests that they will be dead in a month. These are the included batteries so presumably they were new.

I will post a follow-up on December 5th with more information as to battery consumption.

Apple Subscription Plan

Posted in Apple, Personal, Podcasts by François Schiettecatte on November 2, 2009

Apparently Apple is shopping a $30 month subscription plan to the media networks for content (1, 2).

I think this is a good idea.

Most of the content on the iTunes Movie store is buy-only which is not an attractive proposition since I suspect that most of it will only get watched once (or twice, maybe). The high cost of buying programs is what puts me off buying anything on the store. For example if I want to catch up on the Family Guy on the iTunes Movie store, I would need to buy all the seasons which would cost me between $10 and $30 a season, a show-stopper for me. For that kind of money I am better off going to Netflix but then I have to wait for 2 days. I know you can rent movies and I do once in a while but the selection is pretty thin.

So if I was able to have access to the content on the iTunes Movie store for $30/month and pay per movie rented, that would be an attractive proposition for me.

Attractive in the way cable wasn’t because I would get 70 channels for a basic cable subscription (67 of which I did not watch), and I would need to get premium subscriptions for better quality content, which is why I killed off my cable 4 years ago.

The interesting thing about this plan though is that it moves close to the a-la-carte programming that cable companies fought tooth and nail against time and time again.