. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

{ Tuesday, January 12, 2010 }

How "hard" should your startup's technology be?

When I hear about startups that have made some amazing technical innovation in NLP (natural language processing), OCR (optical character recognition), image recognition, speech recognition, I am somewhat skeptical. The government, especially DARPA, many university research departments, Xerox PARC (and Interval Research, where I worked briefly in 2000) have sunk a lot of time and money into this kind of research, in some cases billions of dollars, and a lot of this research not only available but free to all comers. Improving on them is a vast and difficult task, and even a 5% improvement on the existing state-of-the-art is invisible to the end user - the innovation has to be an order of magnitude better to 'feel' better.

There were dozens of startups that approached us at Flickr with tech that could do fantastic things with image recognition software. But we were pretty good at getting people to tag photos with what were in them. The image recognition software could tell us if a photo was a photo of Mom, but not if Mom was smiling, or looking puzzled, or dancing on her 50th anniversary. People adding tags, descriptions, titles and comments turned out to be better sources of metadata for those photos, and that metadata could then be used in interesting ways.

Some easier, faster, less expensive and I think, more satisfying ways for startups to innovate using these technologies are:

1. Find a new way to use the tech that serves a pressing consumer need (this is hard)
2. Find a new source of data (the original PageRank algorithm's use of links as its primary data source)
3. Create a new source of data and apply the tech to it (in the example above, Flickr using "social engineering" to create a vast amount of human-added metadata)

To have credible tech, it's a bit of a Goldilocks problem. If you build something too easy, it's not defensible, and you'll be easily copied. Too hard, and it's a job that requires government research or academia. Startups whose strength is technical should try to something of middling technical difficulty. Some startups, such as Twitter or Daily Booth, can be successful without a lot of tech. Nothing wrong with that! But I see startups all the time making tech claims that I find dubious.

Entrepreneurs can build on work that's come before to solve lots of interesting problems. So, thank you Berkeley, Stanford, et al!

LINK | 6:20 PM | TB

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

  { COMMENTS }

Caterina,

So what do you think of the state of the art in automated face detection and face recognition? Has it gotten good enough yet to be usable commercially? Or is it still something that isn't quite there yet...

Dave

David Sifry | January 12, 2010 9:09 PM

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Funny, I was thinking about exactly that tech as I was writing that post. I generally think the kinds of tech I mention here get you 75% of the way towards solving the problem, but the last 25% has to be done by humans. You'd never sign a legal document translated by OCR or by the various online language translators.

Similarly you wouldn't trust face-recognition tech for national security. You could use it to, say, have a lifeguard kick a kid out of the adult swim, but nothing more serious. So, not there yet.

Caterina | January 12, 2010 9:17 PM

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

So inspiring post!I was just passing by this site and came across this post.Its really interesting and the ideas are so innovative.You have done superb job.

carte memoire | January 13, 2010 3:21 AM

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Hi Caterina, interesting post.

I agree that many of the innovations touted by some startups are at best incremental, and that they frequently apply to areas that are very, very, hard (eg machine understanding in general).

That said, I think there is a lot of scope for what I would call "modest" technologies, which aim to add value using existing tech in a novel way. Hunch is a good example.

David Semeria | January 13, 2010 5:42 AM

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Yes, I'd hope that Hunch was a good example. Thanks David!

Caterina | January 13, 2010 7:23 AM

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Totally agree with finding and honing in on the right balance... I find it very difficult to gauge that proper balance by catering to those who want to enjoy new "tech" but have to be handheld every step of the way and then those that are bothered by such hand-holding measures... Like the brain, it seems most only use 10% or less of the available features of a startups innovative technology:-)

Norm | January 13, 2010 8:12 AM

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Thanks for the post Caterina. I appreciate your observations.

Your comments on face recognition relate to a conversation a friend and I were having at lunch yesterday.

There are times when you are watching a movie or a TV show and want to know who an actor is. It would be great to be able to take a picture of the actor and find out who it is -- kind of like a photo-driven IMDB.

Does this capability exist yet? If not, maybe you or one of the people who read this post can get started. It's at least worth a $1.99 iPhone App for me ;-)

Larry Davis | January 13, 2010 8:17 AM

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Couldn't agree more. We tend to think in algorithmic solutions because that's what we're used to. Using the Crowd as a global computer is so new a concept that startups still think that getting acknowledged requires something with super-advanced math.

I've been in the field of digital audio watermarking for years. That's pretty cutting-edge. But in the end nobody wanted our product. Least did the end users who's content it was supposed to be tracing.

So yeah, innovation doesn't necessarily mean rocket science. It's more like putting simple parts together in unusual ways.

Dan
@DanielStocker

Dan Stocker | January 13, 2010 8:31 AM

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

I think there's a gap between the academic researchers and industry, where relatively few researchers write stuff that scales or release usable code. Being the first team to have write a usable implementation of some fancy algorithm from academia can be a significant boost, especially given that most of the people that read academic papers are going to be other researchers, not competing start-ups.

Aaron | January 13, 2010 8:43 AM

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

I think a distinction needs to be made between science and engineering here. I'd argue that startups shouldn't bother researching new science for many of the same reasons you mention here, plus the fact that researching new science can be extremely risky and better suited to universities and labs with the proper capital risk profile.

Engineering is taking the advancements in math and science and applying them to building a product. It's something that can only be done in industry and its great to aim for "hard" engineering problems that have a high payoff if you have the technical strength. Solving "hard" engineering problems is a huge barrier to entry because it can require understanding very complex science (and combinations of science that have never been attempted) to a level where you can use it to solve real problems.

Matt Gattis | January 13, 2010 2:49 PM

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Yeah, there's also a bunch of entrepreneurs who take university research and convert them into businesses -- Eric Paley, my confrere from Founder Collective took some MIT research and built it into Brontes Technologies -- dental technology. Another avenue.

Caterina | January 13, 2010 3:06 PM

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

I think you're spot on here. I've been in the technology world a long time and I've heard a lot of claims of "revolutionary innovations." For the most part, they're not revolutionary and, of the part that remains, few are feasible/defensible business propositions... meaning they're not really innovations.

My own personal example: I worked with a startup that was going to use web crawling and fancy Bayesian AI to create market analyst research reports. It failed for numerous reasons, not least of which was that you could hire humans to mine the same data... and humans are more trainable and cost fewer dollars in HW and ISP charges than a fancy Bayesian AI machine!

Dan Greenberg | January 13, 2010 6:15 PM

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Great post. One comment, I have entered a few tech heavy industries like Speech Recognition and have been successful by using human horse power to overcome the initial need for crazy tech. Doing this gives you speed to market, ability to get real market feedback and breathing room to build your tech over time.

Jamie Siminoff | January 14, 2010 10:10 AM

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

I agree with this post wholeheartedly. At Tellme we built a >$100 million/year business doing speech recognition, but we licensed all the tech off the shelf. Sure, we did a lot of things to make it better, but for the core recognizer, we licensed it from folks like Nuance or IBM who had massive R&D budgets and had sunk huge costs into building the core tech. The funny thing is, the core tech was not very usable out of the box -- all the art in making that stuff actually work in a real application was where most of our value was added.

A related item is how often "pure tech" startups end up becoming small (from a shareholder value standpoint) companies. If they don't flame out, they get gobbled up by a big Microsoft / Apple /etc. type of company for unimpressive amounts. Examples include Fingerworks, acquired by Apple to do things with touch, and Seadragon, acquired by Microsoft (and their tech is pretty amazing, one must admit). Neither was a large outcome from a shareholder standpoint, but both have technology that has the potential to impact millions of consumers in profound ways and create large commercial opportunities, if it can be harnessed properly by the new corporate parents. However, neither would have been a large successful company (from a revenue standpoint) on their own.

Angus Davis | January 14, 2010 3:26 PM

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Caterina,

Very interesting post. I am curious specifically what you think of innovation in speech recognition and transcription technology. It seems every other week I read some article about how adoption is expected to rapidly increase in hospitals saving doctors time and hospitals money on transcription costs. This technology has been around for a long time though and adoption still seems very limited. This seems consistent with your comment that innovation needs to be an order of magnitude better as opposed to small improvements in accuracy before the technology will be accepted for widespread use. Do you think that sort of innovation is likely / possible given where the technology is today? Any thoughts?

Christof Pfeiffer | January 17, 2010 9:52 AM

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

{ Post a comment }
















. . . . . . . . . . . . . . . . . . . . . . . . .