Saturday, 29 December 2007

The Semantic Web

I'm interested in how humans attempt to extract value from the flood of information they create (search for my earlier posts on "Myth, Media and Meta"), devising more and more effective ways of extracting this value.  My candidate for the next big thing in this regard is what's labeled the Semantic Web, which the anonymous authors of Wikipedia describe as:

... an evolving extension of the World Wide Web in which web content can be expressed not only in natural language, but also in a format that can be read and used by software agents, thus permitting them to find, share and integrate information more easily.[1] It derives from W3C director Sir Tim Berners-Lee's vision of the Web as a universal medium for data, information, and knowledge exchange. ...

Link:  Wikipedia.

In a 2001 article in Scientific American, Tim Berners-Lee, James Hendler and Ora Lassila described it in some detail.  The article, The Semantic Web, is still freely available on its web site.

The December 2007 issue of the same magazine has very good update on this effort, titled The Semantic Web In Action (authors: Lee Feigenbaum, Ivan Herman, Tonya Hongsermeier, Eric Neumann and Susie Stephens).  Unfortunately, the Scientific American web site currently has it available for paid access only and that payment is greater than the cost of the magazine, so look for it on newstands before it disappears.  Highly recommended.

Also see Nigel Shadbolt, Wendy Hall and Tim Berners-Lee, The Semantic Web Revisited, IEEE Intellilgent Systems, May/June 2006 and, for the technically inclined, the World Wide Web Consortium's W3C Semantic Web Activity page.  --Dennis

PBCore metadata standard

Pbcore The PBCore metadata standard is, IMHO, one of the most important and, arguably, the most progressive thing the Corporation for Public Broadcasting has ever funded.  It's a necessary tool to enable public media to transition to the digital age -- assuming, of course, that public media's managers can find their way to the digital age. 

Marcia Brooks is directing the project from the Carl and Ruth Shapiro Family National Center for Accessible Media at WGBH in Boston and she's written a great article on it for the December 17th issue Current, the public broadcasting newspaper.  Current seems to delay posting its articles online for some reason, so the the Public Television Digital Archive has posted a copy of Get going in metadata today with this amazing free kit! online [PDF].  Nice job, Marcia, and thanks to Nan Rubin for posting this important overview.

Update 29 December 2007:  I see that the PDF has also been posted at the PBCore web site.

Update, 4 January 2008:
An HTML version is now posted at Current.  --Dennis

Sunday, 09 December 2007

RoboWheat

Robowheat_3small My son Andrew is a senior industrial design major at the Rhode Island School of Design.  He's taking several digital media courses this semester and as a final project he designed a "robotic wheat field" inspired by the real wheat fields that surround the woods where our home is located.  The "wheat" in this case are 28 stiff vertical wires mounted seven at a time on each of four boards on the floor.  As you walk through the "field" and move your arms, animated light streams from the ceiling in the form of projected RSS feeds (the feed he set up for the show was a Technorati Robowheat1_3search for all posts relating to "sunshine").   The light follows your movement through the field.  The device is currently in its second edition - you can check it out on his site and on YouTube.  Click on the images to the right for larger versions.  More information and pictures here and here.   Very clever, says Dad.  --Dennis

Saturday, 08 December 2007

Watching What You See on the Web

Big_brother_theater_4You better not pout,
Better not cry,
Better be good,
I'm telling you why:
Santa Claus is coming to town!

He's making a list,
Checking it twice,
Gonna find out who's naughty or nice.
Santa Clause is coming to town!

He knows when you've been sleeping,
He knows when you're awake.
He knows when you've been good or bad,
So be good for goodness sake!

[Chris Isaak: "Santa Claus Is Coming To Town"]

Back in February of 2006 I wrote a short essay here called The "new Hoover" and attention metadata about privacy concerns relating to our ability to collect metadata about our Web browsing.  Usually, these metdata enhance our Web experiences as in the case of a site like Amazon which gets smarter about our preferences as we revisit the site.  But this technology has also been used for government snooping.  Again, I'm sure most of us don't mind when that's used against child porn distributors or real terrorists, but on the principle that "if it can be done, it will be done," it doesn't take too great a dose of paranoia to imagine less beneficial uses.

Now comes Bobby White of the Wall Street Journal with a distrubing story about companies (NebuAd, FrontPorch and Phorm are mentioned) that are supplying technology to Internet Service Providers to monitor your surfing and target ads to you based on the metadata it collects.  Since your ISPs already have your name, address, phone number and payment information, it's a small step to associating you with the metadata.  White writes:

... This technique -- called behavioral targeting -- is far more customized than the current method of selling ads online. Today, it's an imperfect process: companies such as Revenue Science Inc. and Tacoda Inc., which was recently bought by Time Warner Inc., contract with Web sites to monitor which consumers visit them, attaching "cookies," or small pieces of tracking data, to visitors' hard drives so they are recognized when they return. The targeting firms feed the data to Web site owners, who use it to charge premium rates for customized ads. But the information is limited, since the tracking companies can't monitor all of the sites an individual visits.  ¶  The newer form of behavioral targeting involves placing gear called "deep-packet inspection boxes" inside an Internet provider's network of pipes and wires. Instead of observing only a select number of Web sites, these boxes can track all of the sites a consumer visits, and deliver far more detailed information to potential advertisers. ...

Link:  Wall Street Journal.

According to the article, NebuAd says it "doesn't track traffic to sites related to sex, health or politics."  Sure.  How does it know you're headed there unless it's tracking some aspect of that?  [See clarification from NebuAd CEO in Comments section below.  It does sound like that company is making a good faith effort to protect privacy, but I feel that the following paragraph is still generally valid.  --Dennis]

All this stuff is subject to subpoena and press scrutiny.  Or, perhaps under the Patriot Act that's not even necessary.  Ask a librarian.  All this is extremely troubling and goes to the core of the value of information exchange to us all.  These ISPs and technology companies want us to trust that they're not abusing this  (the article notes that some ISPs are permitting consumers to opt out).  OK, but we've gone far enough down the road of constitutional erosion in this country that it's not them I'm worried about.  --Dennis

Continue reading "Watching What You See on the Web" »

Saturday, 01 December 2007

Facebook facing criticism over usage information

Catherine Holahan writes:

Facebook made modifications to a controversial advertising system that many users considered an invasion of privacy. On Nov. 29, the social network gave users greater control over the tool, known as Beacon, which broadcasts what they do on partner sites to other Facebook users. ...

... The Beacon brouhaha underscores the dilemma faced by social networks and other Internet companies such as Google (GOOG) attempting to generate revenue from the wealth of information they collect about users. Letting advertisers capitalize on such information helps Web companies collect sales and grow. But sharing too much data can result in the kind of backlash encountered by Facebook.  ...

Link:  BusinessWeek.

No one has been more critical than Umair Haque.  Here are five recent posts:

Edge Principles: Love > Indifference.  Link:  Bubblegeneration.

Fixing Facebook.  Link: Bubblegeneration.

Beacon 2.0: How Not to Think Strategically About... Link: Bubblegeneration.

Industry Note: More Evil Than Evil.  Link: Bubblegeneration.

Research Note: Google vs Beacon, Or Why Advantage is in the DNA.  Link: Bubblegeneration.

Brad Stone has an interview with Facebook about all this.  Facebook Executive Discusses Beacon Brouhaha.  Link:  New York Times.

Also see Louise Story and Brad Stone, Facebook Retreats on Online Tracking.  Link: New York Times.

Update 2 December 2007:
Om Malik has the background on this.  See To Save Its Bacon, Facebook Weakens Beacon.  Link: GigaOM. --Dennis

Thursday, 29 November 2007

Myth, Media and Meta podcast

As I noted in the previous post below, audio files and presentation decks from the 2007 Iowa DTV Symposium have now been posted online.  This includes my own presentation from Oct. 2nd (and intimidating it was to be presenting the same afternoon as John C. Dvorak and Mark Schubin) titled Myth, Media and Meta: Three Information Epochs and What They Mean for Broadcasting.  Here is the agenda description:

Humans have always created information faster than we create humans. And, consequently, humans are in a constant struggle to extract value from the "noise" of too much information in their environment. The title refers to epochs that are characterized by the techniques we've used. "Myth" being story-telling, poetry, music, etc. "Media" dates from Gutenberg and encompasses traditional broadcasting. "Meta" is the digital age, characterized by use of metadata, compression, "pull" distribution, and distribution systems that can learn user preferences. Broadcasters use all these techniques and social systems may enable them to be successful in "many-to-many" distribution in the future.

Links:  MP3, PPT or for Flash version of the deck click here and scroll down in the Content Track to 4:30 on Tuesday.  Also, I've placed both the MP3 and PPT links permanently in the Files section to the left.

For a written version, see these four posts from early June.  Reboot Redux: Myth, Media and Meta; Part 1, Part 2, Part 3, Part 4.   --Dennis

Sunday, 28 October 2007

First Monday has moved

The excellent online peer-reviewed journal about the Internet, First Monday, has moved from www.firstmonday.org to Open Journal Systems at the University if Illinois at Chicago.  Its new (unmemorizable) URL is www.uic.edu/htbin/cgiwrap/bin/ojs/index.php/fm/ and it now requires a free registration.  Its archives have been moved here also.  In a big step forward, it now has RSS and Atom feeds, so I've subscribed in Bloglines and will be seeing new things as they're posted rather than every few months when I happen to remember I haven't visited the site in awhile. 

From the September 2007 number, check out Desperately seeking the consumer: Personalized search engines and the commercial exploitation of user data by Theo Röhle.  Here's the abstract:

With reference to surveillance studies theory, this paper critically assesses the role of personalized search engines as a mediator between advertisers and users. It first sketches the economic and technical background of online marketing and personalized searches. Then, it engages in an in–depth discussion of two examples of personalized search engines with regard to the data collection process used and the way in which this data is used for advertising purposes. The discussion shows that users’ information needs, as well as their personal data, are subject to a growing pressure in terms of commercial exploitation. Essentially, search engines now fulfill the task of translating information needs into consumption needs.

Link:  First Monday.  --Dennis

Friday, 26 October 2007

Vin Crosbie's giving away consulting advice

Consultant and blogger (Digital Deliverance) Vin Crosbie will be resident this academic year as an adjunct professor at the S. I. Newhouse School of Public Communications at Syracuse University.  He's posting several essays online "giving away consulting advice," four so far.  The first was about immediacy, the second about multimedia, the third about podcasting and vodcasting, and the fourth is about interactivity.  He mapped out six others, but haven't seen any more posted since mid-September.  These are well worth your time.  --Dennis

Recommending the tail

Check out Jeff Jarvis's post of this title about social distribution and recommendation and how they work as business drivers.  Be sure to follow the two links with which he leads off the post and check out the reports he's discussing.  Link:  BuzzMachine.  --Dennis

Thursday, 04 October 2007

Myth, Media, Meta: Three Information Epochs and What They Mean For Broadcasting

I was happy (though I had to follow two of my tech heroes, John C. Dvorak and Mark Schubin) to give a presentation again this year as part of the Iowa DTV Symposium, a national event held annually in Des Moines and organized by Dan Miller's great staff at Iowa Public Television.  My topic was the title of this post and attempts to use information theory to find a middle ground between legacy media and new media, the former group too often suffering from hubris and the latter often characterized by naïvté.  I made a preliminary series of four posts on this topic back in early June (here's part I and you can get to the other three from it).  My wife the professor and I are writing a book expanding on the topic.  Stay tuned.

I think they'll be posting audio to their web site, but for now, you can look at my PowerPoint deck which I've posted in the Files area on the left of this blog's main page.  Here's the direct link.  --Dennis

Sunday, 29 July 2007

Battle for the Future of the Net

We humans create information faster than we create ourselves.  Arguably, the advance of technology has been the effort to extract value from this information -- to distribute it more widely, to stuff more of it into pipes and storage devices, and most importantly to find it when we need it.  The "Semantic Web" is the the best candidate for the next epochal change in this "arms race." Jennifer L. Schenker has two good articles on it (the second is the "Taming..." link at the end of this quote):

... Now, as the next generation of Internet technology edges toward the market, European companies and policymakers are determined not to suffer the same fate. In a bid to get ahead of U.S. researchers, they are underwriting research into the so-called Semantic Web—also sometimes called Web 3.0—to the tune of hundreds of millions of euros. "The U.S. and Europe are competing on funding something that could have an extraordinary strategic impact," says Whit Andrews, a research vice-president at technology consultancy Gartner (IT).  ¶  At stake is nothing less than the future of the Net. Developed in part by Berners-Lee, who is now based at MIT, the Semantic Web goes well beyond today's relatively static information highway to add richer media and support for vast pools of unstructured data—in effect, making all the world's knowledge available online. It also connects the information in ways that will let users discover novel associations among unrelated data. That has big implications for fields ranging from the military to medical research to business intelligence (see BusinessWeek.com, 4/9/07, "Taming the World Wide Web"). ...

Link:  BusinessWeek.

See also the Semantic Web article in Wikipedia.  --Dennis

Friday, 27 July 2007

Google to offer Internet portal for Sprint's WiMAX network

I've posted recently about the Clearwire and Sprint WiMAX partnership because I think it's a signal that the broadcasting and consumer electronics industries need to get ready for WiMAX-to-the-dashboard.  Jacqui Cheng gives us another signal with a story about Google coming to the party.  The story ends:

... Google's current collaboration with Sprint will provide a boost as Sprint and its WiMAX partner, Clearwire, try to raise the profile of WiMAX. The wireless broadband technology has had a hard time getting off the ground in the US, but Sprint plans to roll the high-speed wireless tech out in Chicago, Baltimore, and Washington D.C. by the end of the year, and 17 more US cities by April of 2008. The company says that it hopes to reach 100 million people by the end of 2008.

Link:  Ars Technica.

Also see Cheng's 2008 WiMAX rollout scheduled for Chicago, Indy, Denver, and more.  Link:  Ars Technica.  --Dennis

Saturday, 21 July 2007

Is Web 2.0 a Manifesto for Anarchism?

Bomb David Jennings has a interesting review of a fascinatingly prescient 1973 book by anarchist Colin Ward.  Anarchists have always been symbolized by the round bomb and seen as short on theory.  But read this sample:

... Anarchy in Action is no call to guerilla direct action to undermine the state apparatus. But it was both radical for its time, and prescient. "Anarchists are people who make a social and political philosophy out of the natural and spontaneous tendency of humans to associate together for their mutual benefit," writes Ward near the start of the book. He goes on: "we have to build networks instead of pyramids." So are we all anarchists now, and what does it mean to be an anarchist in the era of Web 2.0? I read this book because I had a hunch that there was a common thread running through old theories and current practice, and I wanted to see how strong this thread might be. ...

Link:  D J Alchemi.  --Dennis

Saturday, 07 July 2007

David Weinberger: Everything is Miscellaneous

Phil Windley and colleagues have an interview in the Technometria podcast series with David Weinberger, author of Everything is Miscellaneous.  They describe it as follows:

In Everything Is Miscellaneous, David Weinberger charts the new principles of digital order that are remaking business, education, politics, science, and culture. In his rollicking tour of the rise of the miscellaneous, he examines a number of topics to illustrate this change.  He joins Phil, Scott, and Ben to discuss the book.  ¶  He first discusses his background, as well as his previous books (The Cluetrain Manifesto and Small Pieces Loosely Joined). He assesses the new methods of organization and how they are changing how information is used.  David also reviews how the changes affect the gatekeepers and what the future might hold for them. He shows how by "going miscellaneous," anyone can reap rewards from the deluge of information in modern work and life.

Link: ITConversations.com.

Thursday, 05 July 2007

Virtual Worlds as Social-Science Labs

If you read the recent post below on Second Earth, you should also be interested in Andrea L. Foster's article  on the work Professor Edward Castronova of Indiana University at Bloomington in the area of virtual world applications.  Link:  Chronicle of Higher Education.  --Dennis

Tuesday, 03 July 2007

Second Earth

There's a terrific article with this title by Wade Roush in the July/August issue of Technology Review which describes a mind-expanding but imagined mash-up between (e.g.) Second Life and (e.g.) Google Earth -- to something called a metaverse.  I've long thought that Second Life and its cousins were a natural place for visual storytelling, and particularly for educational applications.  This makes it irresistible.

Link:  Technology Review (free registration required).  Related video (no registration needed).

Update 5 July 2007:
Also see Andrea L. Foster's, Virtual Worlds as Social-Science Labs.  Link:  Chronicle of Higher Education.  --Dennis

Saturday, 30 June 2007

Gorman and Shirky on authority vs. openness

The Britannica Blog has hosted a very interesting exchange of essays between librarian Michael Gorman (ex-Dean of Library Services at California State University, Fresno) and Clay Shirky, teacher (New York University's Interactive Telecommunications Program), writer and consultant.  I've been reading Shirky for years.  It's the openness vs. authority argument; Wikipedia vs. Encyclopædia Britannica (though of course, not just that); or, in book length, David Weinberger's Everything Is Miscellaneous: The Power of the New Digital Disorder (reviews) vs. Andrew Keen's The Cult of the Amateur: How today's Internet is killing our culture (review).

Gorman begins with The Sleep of Reason, Part I (11 June) and Part II (12 June).

To which Shirky responds with "Old Revolutions Good; New Revolutions Bad" (14 June).

Gorman continues with The Siren Song of the Internet, Part I (18 June) and Part II (19 June).

To which Shirky responds with The Siren Song of Luddism (19 June).

Gorman continues again with Jabberwiki: The Educational Response, Part I (25 June) and Part II (26 June).

The Britannica Blog follows up with other responses:  Publisher Roger Kimball (The New Criterion, Encounter Books) also contributes to this debate in Technology, Temptation, and Virtual Reality.  Doctoral candidate (U. C.-Berkeley) danah boyd in Knowledge Access as a Public Good (27 June).  Reference librarian Thomas Mann in Brave New (Digital) World, Part I: Return of the Avant-Garde and Part II: Foolishness 2.0?.

I've not gotten through the essays in the last paragraph yet, but so far it's good reading.  --Dennis

Friday, 01 June 2007

Reboot Redux: Myth, Media and Meta (part 4 of 4)

Meta

This is the last of four parts of a talk originally scheduled this week at Reboot 9.0 in Copenhagen.  Links to the others follow this post.

With the rise of mass media, information flow quickly exceeded the attention capacity of humans.  Information has value to many (e.g., American Idol) or to a few (e.g., this weblog), but if you don’t have the attention or the interest or the need for that information, it’s really high-entropy “crud,” not information at all.  So now we have a subjective definition of information.

Only among humans has there been a power curve of information growth such that managing a surplus of information becomes a problem for our species – maybe the central problem.  Or, to quote a popular version of science fiction writer Theodore Sturgeon’s Law, “Ninety percent of everything is crud.”  Of course, each of us has a different standard for the useful 10 percent – hence, the secret of Chris Anderson’s long tail meme describing the power law distribution of content popularity.

Metadata_2 In response, in the recent human past, we have developed information about information – metadata (e.g., the library card) – and very recently we’ve developed a powerful peered network – the Web.  We’re learning now how to dynamically add metadata to nodes on the Web to make them more valuable with each usage.  These techniques are often subsumed under the Web 2.0 label, but they date at least from the start of Amazon.com.  This is the “meta” epoch of information – the “selfish Web” where each user can find low-entropy value.  And, in contrast to the media epoch, both dissemination and origination are democratized.

Is it our ability to individualize information – to live within multiple communities of interest – that makes us human today?  When this, too, exceeds our attention limits, what will be the next means of extracting value from disorder?

See also part one (intro), part two (myth), and part three (media).  --Dennis

Reboot Redux: Myth, Media and Meta (part 3 of 4)

Media

This is the third of four parts of a talk originally scheduled this week at Reboot 9.0 in Copenhagen.  Links to the others follow this post.

The problem is, “myth” by itself doesn’t scale.  When humans spent most of their time in family or other social groups, this did not matter much.  But as humans began migrating and congregating in larger populations, the transmission of myth became more difficult, and conflicting myths began contending with each other for primacy (e.g., the Crusades).Gutenberg_2

Writing is a medium and has a long history, but for our purposes I really want to consider the impact of mass media.  And, again for our purposes, I’d like to date that from the mid-15th century invention of movable type printing in Europe by Johannes Gutenberg (by then it had existed for 400 years in China, but its use with the more limited European alphabet made it successful).  Instead of one copy of the Christian Bible per scribe per year, humans now could reproduce information at some scale.

Not only books but polemics like the Disputation of Martin Luther on the Power and Efficacy of Indulgences (1517) got wide distribution.  In only 80-100 years after its European invention, movable type resulted in a flip to Protestant Christianity for much of the continent.  In the modern world, the “power of myth” (to borrow from the title of the classic Bill Moyers PBS series with Joseph Campbell), is the power of the mass media.

Of course, mass media also power other information as well.  Science and democracy have flourished with wide dissemination, but so have totalitarian ideologies and pornography.

Arguably, mass media have enabled content creation on a scale and of a quality not possible without it.  It’s doubtful, for example, that Halldór Laxness would have written his amazing novels (see the reference in part two) if his distribution was initially limited to a single copy on vellum.  But then that’s exactly what his fellow Icelander, Snorri Sturluson, author of the Prose Edda and other works did some seven centuries earlier.

If ideas were beginning to bump into each other before the invention of movable type, their conflicts became orders of magnitude greater afterward.  What hadn’t been replaced from pre-media days was the role of the information disseminator – though one-to-a-few became one-to-many.  Distribution was democratized but origination was still in the hands of the few.

Is it our ability to scale information through mass media that makes us human?

More about this in the last post in this series (part one was an introduction, part two covered the role of myth, and part four will conclude with what I’m labeling “meta”).  --Dennis

Reboot Redux: Myth, Media and Meta (part 2 of 4)

Myth

This is the second of four parts of a talk originally scheduled this week at Reboot 9.0 in Copenhagen.  Links to the others follow this post.

Myth – which I’m using here very broadly – is the most persistent method of lowering entropy, originating from the necessities of passing on information by oral means, and continuing to the present day – even in educated Western societies.  Poetry, song and legend are the building blocks of myth.  Easy to remember, crafted in metaphor, they have served through millennia to transmit culture, religion, history, law, social mores, and pre-scientific explanations of natural phenomena.

I’ll pick an example from Norse mythology.  My maternal grandmother was born on the island in Denmark which is home to the city of Odense.  Its name comes from Norse words meaning “Odin’s sanctuary.”  My surname comes from the family farm in mid-Norway.  It has several ancient stone-pile graves and the name is likely a kenning – “Hár’s acre” – “The High One’s field,” from one of Odin’s many nicknames – for this field of graves.

300pxodin_hrafnar Odin was effectively the Nordic/Germanic god of information –credited for eloquence, poetry, music, wisdom, magic, prophecy and inventor of the runes – writing.  His myth has it that he pawned one of his eyes in exchange for wisdom.  Each morning, he sent his two ravens, Hugin and Munin – thought and memory, respectively – to fly the world bringing back what they had seen and heard.

One of the poems in the Poetic Edda, preserved in Iceland in the 13th century from the oral tradition, is Hávamál, The Sayings of Hár where Odin provides advice for wise living.  Examples below from the translation by W. H. Auden and P. B. Taylor:

No. 27
The ignorant booby had best be silent
When he moves among other men,
No one will know what a nit-wit he is
Until he begins to talk;
No one knows less what a nit-wit he is
Than the man who talks too much.

No. 58
Early shall he rise who has designs
On another’s land or life:
His prey escapes the prone wolf,
The sleeper is seldom victorious.

Pretty benign stuff, but myth derives strength by being close to power – the shaman or priest or official or family head.  Even today, otherwise sober people are demanding that myths be substituted for science in the education system of scientifically sophisticated countries.  Tragically, deaths from stoning or decapitation are still happening in other parts of the world when the codes that myths engender are violated.

But myth can also be sublime.  It would be hard to convince me that any form of information provides more “value per bit” to humans than literature, poetry and music.  To me, the music of Johannes Brahms and the novels of Halldór Laxness touch my soul – to use a very unscientific word – like nothing else.  From the latter’s masterpiece, Independent People, these words about the principal character’s step-daughter:

When a man looks at a flowering plant growing slender and helpless up in the wilderness among a hundred thousand stones, and he has found this plant only by chance, then he asks:  Why is it that life is always trying to burst forth?  Should one pull up this plant and use it to clean one's pipe?  No, for this plant also broods over the limitation and the unlimitation of all life, and lives in the love of the good beyond these hundred thousand stones, like you and me; water it with care, but do not uproot it, maybe it is little Ásta Sóllilja.

The essence of his subtle and complex novel is captured in this one paragraph in a way that a similar number of words of, say, business writing, cannot replicate.

Is it our ability to create myth that makes us human?  Myth hasn’t been replaced by media and “meta.”  To the contrary, myth was an early adopter of, and has arguably thrived under, each.

More about this in part three on media (part one was an introduction, part three deals with media, and part four will conclude with what I’m labeling “meta”).  --Dennis

Reboot Redux: Myth, Media and Meta (part 1 of 4)

Introduction

The Reboot 9.0 conference in Copenhagen that ended today had as its theme the word, “human” – that simple and that complicated.  Until last week, I planned to attend and to do a talk on what one might call a human information theory of value.  The following from my notes for that talk takes a look at three successively-layered epochs of human information dissemination, which I’m labeling “myth,” “media,” and “meta.”

Note:  This is the first of four parts.  Links to the other three parts come at the end of this post.

In 2000, Michael Lesk, now a professor at Rutgers University, estimated that there were 12 exabytes (12 x 1018 bytes) of recorded information in the world and that it was increasing at 4 exabytes per year.  Another study at the University of California at Berkeley found that 5 exabytes of recorded information was produced in 2002 alone.  Five exabytes equals 37,000 libraries the size of the Library of Congress’s book collection.  That would put the aggregate amount of recorded information in 2007 in the 45-exabyte range – 6.8 gigabytes (6.8 x 109 bytes) of recorded information for each of the 6.6 billion persons on the planet.

Roll back the calendar 2,000 years and the world’s population was the same as that of the U.S. today – 300 million.  Information then was largely recorded on scrolls.  The largest library of that era was the Royal Library at Alexandria, Egypt, a collection variously given as 400,000-600,000 scrolls at its peak.  I don’t know how much information was on each scroll, but let’s assume that each one was the equivalent of 100 typewritten pages – 200 kilobytes of information each – 100 gigabytes total.  This was a very comprehensive library, but let’s figure that it missed 90% of the world’s recorded information.  So a rough guess, then, that is the aggregate was one terabyte (1 x 1012) – just over 3 kilobytes (300 words) of recorded information for each person on the planet.  In 2,000 years, population has grown (mostly in the past century) 22-fold, but recorded information per person has grown some 2 million times!

Of course recorded information is not the only information in the world.  The information coded in Earth’s physical and biological structures no doubt dwarfs recorded information.  Atoms assemble from atomic subparticles and likewise molecules assemble from atoms through information.  Information also determines the development path of each organism and organisms make use of information to survive.  But these information systems are stable enough that one can treat them as effectively closed systems.

Entropywarped_small The late comic Buddy Hackett affected a bumbling on-stage demeanor and was once invited to replace Curly Howard in the Three Stooges, but his humor sometimes had an intellectual foundation.  My favorite was a classic explanation of entropy – a measure of disorder in a closed system.  While staring intensely at a glass of water, Hackett was asked what he was doing.  Hackett replied, “Someone told me that I could bring a glass of ice water to a boil just by staring at it.  I’ve already got it up to room temperature.”  The Second Law of Thermodynamics describes how the temperature of both ice water and boiling water tend to reach an equilibrium over time – in this case to room temperature – through increasing entropy.

Two thousand years back, an individual could get the equivalent of his or her share of aggregate recorded information – 300 words – in conversation at breakfast.  Today, an individual’s exposure to his or her 6.8-gigabyte share takes considerably longer, even with the New York Times next to the plate and the Today show playing in high definition in the background.  Entropy is still inexorable, but for humans, the “information room temperature” is increasing as we invent ever-more sophisticated ways of extracting information value from the disorder around them.

Part two takes a look at the first of these inventions – the role of myth in human extraction of value from information.  Part three will look at the role of media and part four will look at what I’m labeling “meta” – information about information in a networked world.  --Dennis

Powered by Qumana

Thursday, 19 April 2007

Search and Co-Opt

Michael Elins writes:

... The other approach, conceived by PodZinger, a video-search startup in Cambridge, Massachusetts, is this: Co-opt the pirates. Unleash them to spread your media virally, and let PodZinger track viewership--and kick back ad revenue. That makes "piracy" profitable to the copyright holder.  ¶  At the core of PodZinger's proposed solution is video search, a problem it has largely cracked. ...

Link:  Fast Company.  Thanks to Chuck McConnell for the tip.  Link added.  --Dennis

Tuesday, 03 April 2007

What is video search?

Andrew Keen moderated a panel on this topic at the iHollywoodForum Digital Media Summit.  He's posted a video of the session on his vlog.  Link:  AfterTV.  Interesting panel, worth suffering through the bad audio.  --Dennis

Sunday, 01 April 2007

Can Social tools save plain ole’ Radio?

Rags Gupta writes:

... This spells opportunity for terrestrial radio. They can use their clout to negotiate on-demand and Webcasting rates to offer these services online and they have the brands, distribution and ad sales resources to be able to optimize the monetization of these services. They should also consider buying some of these music social networking companies and weaving the service into their existing online sites. On the flip side, the labels and music publishers should consider crafting voluntary, standardized licenses for social networks to be able to enable their users to share and listen to music off their profile pages on-demand. After all, they want to get paid for the use of their music and the easier they make this, the more licensees they’ll have. ...

Link:  GigaOM.

Community 2.0

John Hagel has a very thoughtful and lengthy essay on the challenges in building virtual communities.  Link:  Edge Persepctives.  --Dennis

Thursday, 29 March 2007

PBCore for publishing, sharing, and preservation

I'd like to shine some light on a good conversation that has popped up about the uses and importance of syndication and metadata on an Integrated Media Association blog.  See Jack Brighton's post with this title, John Proffitt's post, RSS a good start, but a federated PBCore-based metadata archive would be better, Dale Hobson's post, Thoughts from back home: centralize output, not input, and also the comments on those posts.  --Dennis

Tuesday, 27 March 2007

Thinking Strategically about Search

Umair Haque writes:

... Think about this intuitively: the more crap there is, the more stuff you have to wade through - the happier Google is (at least in the short run).  ¶  Let me put this even more succinctly. Google doesn't care about absolute levels of quality - it only cares about relative levels of quality. And the more media it indexes, the stronger this dilution of incentives gets.  ¶  Hence, it's incentive to "support" content creators, already weak, is going to diminish over time. ...

Link:  Bubblegeneration Strategy Lab.

Tuesday, 20 March 2007

Links, the Currency of The Machine

Terry Heaton writes:

Here is the latest in the on-going series of essays, TV News in a Postmodern World. This one deals with something we all take for granted about the web — links and linking. These, I believe, are the real currency of the web and that one day, like cash, we’ll find a way to buy and sell goods and services using them. Who’ll calculate the value? “The Machine,” of which Kevin Kelly so brilliantly wrote in his 2005 Wired essay, “We Are The Web.”  ¶  Links play a key role in the web’s determination of the new metric “influence,” and this will grow in terms of validity and value as the years go by. Those of us in traditional media embrace the concept of inbound links, because we can easily see how they help “drive traffic” or distribute our content. We’re reluctant to play with outbound links, however, and this is to our detriment. ...

Link:  Terry Heaton's PoMo Blog.

Saturday, 24 February 2007

Millions of Videos, and Now a Way to Search Inside Them

Jason Pontin has an interesting article on the voice recognition search technology used by Blinkx to search inside videos.  Link:  New York Times.  --Dennis

Saturday, 10 February 2007

Pew survey says Americans are tagging

Self-organization on the Web is happening, according to a 12/2006 study by Pew.  28% of online Americans have used tagging to organize their content and connect to similar content tagged by others.  Link:  Pew Internet and American Life Project [PDF].

Thursday, 25 January 2007

Another Addition to "What Is Attention?"

Michael H. Goldhaber writes:

I have added yet one more draft installment to Chapter 3 of my book, covering many topics having to do with how attention works, how we pay attention to things that emanate from more than one mind or from no mind, along with how attention relates to reputation, recognition, information, time, style, etc.

Thursday, 21 December 2006

Folksonomy vs. taxonomy

Now there's a title that will scare away most readers.

I've been thinking about folksonomies this week, occasioned by a post that Bruno Giussani made in his LunchOverIP blog about Gartner's "Hype Cycle" report.  That report ranked folksonomies as a low benefit emerging technology. 

He didn't report on why and I can't access the report, but that then was followed by a conversation this afternoon at work (broadcasting organization) with a smart colleague who was arguing that the tagging (i.e., folksonomy) and search capability built into the media-over-IP service we've begun to use in the past year will not be sufficient to enable users to find our content and that of our partners without  implementation of metadata on our part and some form of professionally-designed taxonomy.  Not just how do they find stuff, but also, how do they find our stuff?

Then, on the other hand, this evening I ran across a post, Folksonomy as Symbol, on the Berkman blog by Becca Tabasky that quotes a short pro-folksonomy essay (he calls them "bottoms-up taxonomies") by David Weinberger.  He writes:

... If a folksonomy is a symbol, what is it a symbol of?  ¶  First, folksonomies stick it to The Man... ¶  We don't need no stinkin' experts to organize ideas and information! There is, of course, inefficiency built into expert-based taxonomies because they have to choose one way of ordering, and that one way is necessarily infested with personal, class, and cultural biases. As Clay Shirky says, "Metadata is worldview." But beyond the inefficiency, simply having someone else have the authority to say 'It shall be filed thus' is a statement of political authority. Even when the experts do a good job—as they usually do, because they're experts—it is still an implicit statement that someone else's way of thinking is better than yours. ...

... Folksonomies also embrace excess. Publishing and broadcasting by their nature require us to trim the fat from our world. That's how those systems survive ...

Excellent essay, but then my colleague also made some excellent points.  For us professional media types, nothing defines the divide between the way we've done business throughout my 37-year career (and before) and the way we'll likely be doing business for the next 37 years than the admissibility of user organization of media content.  This is core value territory.  Are we to have self-organization for "small craft" content (videos of cats swinging on fans) and expert organization for "big craft" content?  Or is there a role for both?

We broadcasters need to understand that all curation doesn't need to be done on the ground on a program-by-program basis.  That's an artifact of the scarcity paradigm that's constrained us for decades.  Like most humans, we're good at making necessities into virtues.  But in an abundance paradigm, not only doesn't curation need to happen at the program level, it doesn't need to be done only by us.  It's not hard to envision curation at the five- or ten thousand-foot level -- which might be a working definition of taxonomies -- existing simultaneously and productively with user curation at ground level.

--Dennis

Wednesday, 20 December 2006

The Economics of Attention

John Hagel has a useful overview of writings about the attention economy in his Edge Perspectives weblog.  I've posted a number of links to the work in this area of Michael Goldhaber.  To access these, type Goldhaber in the Rollyo Search Weblog box on the lower right side of this page, then select Technology360.com as the search target.  --Dennis

Sunday, 26 November 2006

Gifting technologies: A BitTorrent case study

File-sharing services have an interesting problem balancing contributors and users.  Matei Ripeanu, Miranda Mowbray, Nazareno Andrade, and Aliandro Lima have done some interesting analysis of this in the case of BitTorrent.  The abstract:

This paper is concerned with gifting: giving not motivated by a direct, immediate, or obvious benefit. We analyze a popular technology used for gifting: the BitTorrent file–sharing system. We determine features associated with high levels of gifting and suggest changes to the protocol and to the design of associated BitTorrent Web sites to promote it. We then extend our conclusions and suggestions to gifting technologies in general. ...

Link:  First Monday.  --Dennis

Tuesday, 21 November 2006

Please DO Feed the Sites

ptvGuy has posted two of what he plans to be a four-part tutorial on RSS and why you should add feeds to your web site.  Links: Part 1 - Introduction.  Part 2 - What Is RSS?.  Very nice work.  Increasingly, a station web site without RSS feeds is like a station without a transmitter -- something like asking people to come by the studio to watch or listen to programs.  --Dennis

Sunday, 12 November 2006

Media Frenzy: A Struggle Over Dominance and Definition

Richard Siklos writes:

... In the last few weeks an enormous swarm of activity has been coming out of the Googleplex beehive in Mountain View, Calif. — much of it aimed squarely at preparing the search company to move its phenomenally lucrative advertising business beyond Web pages and into video, newspapers and radio. ...

... What’s at stake is pretty much everything in the $400 billion global advertising honey pot. Google’s efficiency at putting text ads next to search results is what sets it apart from Yahoo, MSN and the other big boys online. ...

Link:  New York Times.

Wednesday, 08 November 2006

The Tail That Wags the Web

P. J. Fusco writes:

... Long tail theory suggests an expansive search referral treasure trove exists in the unbranded keyword universe. But a strong brand should do more than just yield brand-centric search traffic. It should be put into a position to win non-branded search referrals too. To do so, more than 30 pages of the site need to be optimized.  ¶  Although branded search queries drive the search head, the number of pages that receive search referral benefits is usually a diminutive figure. Studies show as few as 5 percent of a site's pages successfully yield traffic for branded search queries.  ¶  Conversely, this leaves 95 percent of an online merchant's site that can be leveraged to target search tail keywords and keyword phrases. The primary concern is to get those pages yielding traffic. The non-yielding hordes of pages must be optimized, because merely being crawled and indexed aren't enough. ...

Link:  ClickZ Network.

Tuesday, 24 October 2006

Openness Matters. RSS Can Help