SCN: Semantic web

Steve steve at advocate.net
Fri Feb 22 09:10:52 PST 2002


x-no-archive: yes

===================


(Otis Port, BusinessWeek, excerpts)---Tim Berners-Lee...wanted to make 
the world a richer place, not amass personal wealth. So he gave his 
brainchild to us all.  

Now, the idealistic father of the Web plans an even grander gift: a next-
generation Web that almost certainly will rank as the most important 
software of this decade. Berners-Lee regards today's Web as a rebellious 
adolescent that can never fulfill his original expectations. By 2005, he 
hopes to begin replacing it with the Semantic Web--a smart network that 
will finally understand human languages and make computers virtually as 
easy to work with as other humans.  

This new project is a collaborative effort of hundreds of minds, with 
Berners-Lee as maestro. The ultimate goal: to turn the Web into a gigantic 
brain. 

Every computer connected to the Internet would have access to all the 
knowledge that humankind has accumulated in science, business, and the 
arts since we began painting the walls of caves 30,000 years ago. This 
racial memory would be a constant source of inspiration for dreaming 
sublime dreams, boosting human creativity, and solving previously 
intractable problems. 

Online commerce chores and Web services would be handled by software 
modules that snap together like toy Lego blocks. "We expect the Semantic 
Web to be as big a revolution as the original Web itself," says Richard 
Hayes-Roth, Hewlett-Packard's chief technology officer for software.  

To get there, though, Berners-Lee must navigate some very muddy 
waters. Development of the Semantic Web is being funded mainly by the 
World Wide Web Consortium (W3C), which he heads. Founded in 1994 
and based at Massachusetts Institute of Technology, the W3C is the 
guardian of Web technology and standards. Its budget relies heavily on 
membership dues from more than 400 companies. 

And while making money may not be a primary motivator for Berners-Lee, 
it's what business is all about. Conflicts, in short, were inevitable--and not 
just centering around Berners-Lee. Indeed, mediating the inevitable 
clashes among W3C's hundreds of companies, each with its own agenda, 
will be the acid test of Berners-Lee's leadership.  

A particularly thorny issue cropped up last August. A W3C committee of 13 
members, including IBM and Microsoft, proposed installing tollbooths on 
the Information Highway by allowing patented software to be included in 
W3C-approved standards. The committee reasoned that as online 
offerings grow more sophisticated, the developers of software for handling 
advanced Web services, such as supply-chain management and 
collaborative engineering, should be permitted to collect royalties on their 
investments. 

But Berners-Lee is philosophically opposed to standards that would 
impose fees, and many other W3C members, such as the Free Software 
Foundation and the Open Source Initiative, also denounced the notion. 
"Things have calmed down a bit," says Robert S. Sutor, IBM's director of e-
business standards, and the committee is now rethinking its stance. 
Berners-Lee says the mood has now shifted "strongly toward a royalty-
free position."  

Meanwhile, the W3C is taking heat on other fronts. Critics say the 
organization is moving too slowly on developing standards to ensure that 
different Web-service offerings can work together. 

Business sees major revenue growth from better tools that can deal with 
complicated travel arrangements, say, or deliver new entertainment 
options. But companies are reluctant to invest in developing such software 
until big corporations are on the same page. 

What good would it do, for example, to create a program under Microsoft's 
Web-services initiative, dubbed .Net, if it couldn't link up with a related 
program written in Java for Sun Microsystems' counterpart? Or if a 
computer-aided design program at Boeing 
Corp. were unable to talk to the company's engineering or 
manufacturing software?  

A W3C draft specification aimed at harmonizing Web services was 
published in January, 2001, "but the W3C then sat on its hands for 
a whole year" complains Uttam M. Narsu, an analyst at Giga 
Information Group. Not until late January did the W3C organize 
several working groups to tackle standards for Web services. 

"My sense is that W3C staffers are too visionary," Narsu says. 
"They're devoting too much effort to the Semantic Web, believing it 
will change the world yet again, and not enough effort to less sexy 
things that are important to business in the near term."  

The Semantic Web is certainly sexy. As envisioned by Berners-
Lee, it would understand not only the meaning of words and 
concepts but also the logical relationships among them. That has 
awesome potential. 

Most knowledge is built on two pillars: semantics and mathematics. 
In number-crunching, computers already outclass people. 
Machines that are equally adroit at dealing with language and 
reason won't just help people uncover new insights; they could 
blaze new trails on their own.  

Even with a fairly crude version of this future Web, mining online 
repositories for nuggets of knowledge would no longer force 
people to wade through screen after screen of extraneous data. 
Instead, computers would dispatch intelligent agents, or software 
messengers, to explore Web sites by the thousands and logically 
sift out just what's relevant. 

That alone would provide a major boost in productivity at work and 
at home. But there's far more.  

Software agents could also take on many routine business chores, 
such as helping manufacturers find and negotiate with lowest-cost 
parts suppliers and handling help-desk questions. The Semantic 
Web would also be a bottomless trove of eureka insights. 

Most inventions and scientific breakthroughs, including today's 
Web, spring from novel combinations of existing knowledge. The 
Semantic Web would make it possible to evaluate more 
combinations overnight than a person could juggle in a lifetime. 

"A lot of scientific research is now interdisciplinary, like global 
climate change, and the scientists need to talk to each other," says 
Chaitanya Baru, a data-mining expert at the San Diego 
Supercomputer Center. "But they use different jargon."  

Sure, scientists and other people can post ideas on the Web today 
for others to read. But with machines doing the reading and 
translating jargon terms, related ideas from millions of Web pages 
could be distilled and summarized. That will lift the ability to assess 
and integrate information to new heights.  

As a result, Berners-Lee envisions a new age of enlightenment. 
The Semantic Web, he predicts, "will help more people become 
more intuitive as well as more analytical. It will foster global 
collaborations among people with diverse cultural perspectives, so 
we have a better chance of finding the right solutions to the really 
big issues--like the environment and climate warming." In short, it 
will change the world even more than his original creation.  

The capital-Q question is: Can he pull it off? There's no shortage 
of doubters. Still, most people who know the reclusive Berners-Lee 
are optimistic. "Tim has a gift for seeing the future and making it 
happen," says John R. Patrick, a retired IBM senior exec who 
helped found the W3C. 

Eric E. Schmidt, formerly of Sun and now chairman of search-
engine innovator Google Inc., says Berners-Lee would be a shoo-
in for a Nobel prize--if Nobels were given in computer science. And 
Larry L. Smarr, director of the California Institute of 
Telecommunications & Information Technology at the University of 
California at San Diego, predicts the Semantic Web will cast 
Berners-Lee as "an historic-level figure."  

What impresses those elder statesmen of computing is Berners-
Lee's leadership track record. For a somewhat shy software nerd, 
he has demonstrated a surprising flair for diplomacy, combined 
with bulldog tenacity. 

In the midst of the dot-com bust two years ago, Berners-Lee 
persuaded the W3C's hard-nosed denizens of commerce to begin 
developing the Semantic Web. And before that, in 1998, he 
persuaded them to approve extensible markup language (XML), an 
important new Web lingo. "Tim did a great job shepherding XML 
through the W3C," notes Smarr.  

Indeed, the evolution of XML may be a useful foretaste of what's in 
store for the Berners-Lee's new vision. In the late 1990s, this 
language was constructed to help computers identify different 
types of data on the Web. "When we started work on XML, it was 
considered pretty esoteric," recalls Sutor of IBM. "But now it's the 
underpinnings of everything we're doing in e-business." 

Ditto for hundreds of others, including the 300 companies already 
using XML software from Open Applications Group Inc. OAGI 
predicts that number will double this year.  

Berners-Lee worked tirelessly to win support for XML because it's 
a quantum leap beyond today's witless hypertext markup language 
(HTML)--and it's the cornerstone of the Semantic Web. 

HTML is the language that Berners-Lee concocted while on a 
fellowship as a database engineer at the European Organization 
for Nuclear Research (CERN) in Geneva. But the language merely 
specifies the appearance of a Web page: what colors go where, 
which type sizes to use, and where to put graphic elements. 

To a Web browser, or most other computer programs, these words 
and numbers are just squiggles of gibberish. Without some kind of 
clue, computers parsing a Web page can't determine if "buy" is a 
noun or a verb, or whether "20031" is a Zip Code, a price, or the 
number of orders placed last month.  

In contrast, XML tags imbue the Web with meaning. Examples 
might be labels for medical records. The "name" tag would have 
links to relevant sections of online literature, also coded with XML, 
and "interaction" would point to other drugs that interfere with the 
medication. Then, when a doctor bats out a prescription on a 
computer, a software agent could verify that the drug is 
appropriate for the diagnosis, check the patient's records to see 
what other medicines the person is taking, and determine whether 
any of them is likely to interfere with the new prescription. A group 
of university and industrial researchers is already working on such 
a scheme with the Veterans Administration and the National 
Library of Medicine.  

Today, Berners-Lee presides over a research octopus whose 
tentacles extend to all five continents. The 60 staffers at W3C 
headquarters coordinate the efforts of hundreds of researchers at 
50 university and government laboratories that are W3C members, 
plus two-score additional universities around the world. 

For now, most of the actual work on the Semantic Web is being 
done by academics because, Berners-Lee quips, "only a few 
industry people have been given a little leeway to go off and 
explore my crazy ideas."  

XML is a start--but only the tip of the iceberg. XML tags are 
essentially just labels that point to a definition in a combination 
dictionary and thesaurus. That's how a software agent can 
determine that two different tags actually mean the same thing. 
When an agent needs further details, there's an online 
encyclopedia, called an ontology. It lays out the logical rules and 
relationships among XML terms.  

Merging these elements is where semantics gets sticky. Because 
we humans assimilate language gradually, we end up unaware of 
how complicated things are--until we try to construct a new digital 
grammar from scratch, with numerous dialects for various 
industries. 

Devising software that can comprehend words, concepts, and 
relationships has long been a major hangup in artificial intelligence 
(AI) research. Adding a pervasive layer of standardization will test 
the limits of human ingenuity--and patience.  

In the fast-paced Internet Age, the time needed to build consensus 
on the smallest of these details could be the Semantic Web's chief 
obstacle, says MCI's Cerf. He worries that standards could "fall 
victim to business maneuvering" by the W3C's corporate members. 
The result might end up similar to today's systems for electronic 
data interchange (EDI)--with a lot of proprietary systems, each with 
its own lingo. 

On the other hand, partly because the industry is acutely aware of 
EDI's problems and limitations, executives are optimistic. "It'll be a 
chicken-or-egg situation until a killer app comes along--but I'm very 
confident that that will happen," says W. Daniel Hillis, a 
supercomputer pioneer who now heads startup Applied Minds Inc.  

Some academics are enthusiastic about the corporate involvement 
that Berners-Lee has attracted. James A. Hendler, a computer 
scientist at the University of Maryland, says he has worked on AI 
for 20 years and "it has been almost impossible to get the attention 
of business." But now, he says, "the advances we made in the 
1990s are being readied for actual use with the Semantic Web, out 
there in the real world."  

One other factor could give Berners-Lee's vision an enormous 
boost: The Pentagon's Defense Advanced Research Projects 
Agency (DARPA) is pushing it. This is the outfit that created the 
guts of the Internet three decades ago. In 1998, it launched the 
DARPA Agent MarkUp Language (DAML) program--initially 
managed by Hendler, who took a leave of absence from Maryland. 
DARPA is now a W3C member, and DAML is being developed in 
concert with XML.  

DARPA wants to develop agent-based systems for command-and-
control jobs in joint military operations, whether they be 
multiservice or multinational. For example, an international team of 
16 organizations--led by a spin-off of Britain's Defense Ministry 
called QinetiQ Ltd.--is working on a "coalition of agents" project. 
With DAML tags pointing to online databases, plus access to 
satellite reconnaissance images, the agents would be aware of the 
capabilities and locations of the many different weapons and 
logistics systems deployed to such spots as Afghanistan. So they 
could provide commanders with instant advice for coping with 
shifting conditions.  

DARPA is also funding research at MIT, headed by Berners-Lee 
but separate from the W3C, aimed at creating new AI tools for 
tomorrow's Web. One result would be Semantic Web logic 
language (Swell). Another goal is to marry the Semantic Web with 
MIT's Oxygen project, which aims to make various digital systems 
as easy to use as breathing, thanks to advanced machine-learning 
tricks and new AI software. 

Cailliau, Berners-Lee's former boss at CERN, figures the Web's inventor 
relishes this research. "I think Tim does not really like the role" of leading a 
big outfit like the W3C, says Cailliau. "He is more comfortable with a small 
team and joining in the fun of writing actual code."  

Berners-Lee admits that building consensus among the W3C's members 
can be trying at times. But someone needs to keep development of the 
Semantic Web on course toward enriching the world--and nobody is better 
qualified than Tim.  


Copyright 2002 The McGraw-Hill Companies Inc.






* * * * * * * * * * * * * *  From the Listowner  * * * * * * * * * * * *
.	To unsubscribe from this list, send a message to:
majordomo at scn.org		In the body of the message, type:
unsubscribe scn
==== Messages posted on this list are also available on the web at: ====
* * * * * * *     http://www.scn.org/volunteers/scn-l/     * * * * * * *



More information about the scn mailing list