Posts tagged with "semantic web"
|
|
|
Read the Yahoo blog about Yahoo and Semantic Web http://www.ysearchblog.com/archives/000527.html
A few weeks ago, we began talking
about the new Yahoo! Search open platform. Today, we're releasing more
details about two important components of the initiative -- the
developer platform as well as our support of a number of semantic web standards.
The Data Web in Action
While there has been remarkable progress made toward understanding the
semantics of web content, the benefits of a data web have not reached
the mainstream consumer. Without a killer semantic web app for
consumers, site owners have been reluctant to support standards like RDF, or even microformats. We believe that app can be web search.
By supporting semantic web standards, Yahoo! Search and site owners
can bring a far richer and more useful search experience to consumers.
For example, by marking up its profile pages with microformats, LinkedIn
can allow Yahoo! Search and others to understand the semantic content
and the relationships of the many components of its site. With a richer
understanding of LinkedIn's structured data included in our index, we
will be able to present users with more compelling and useful search
results for their site. The benefit to LinkedIn is, of course,
increased traffic quality and quantity from sites like Yahoo! Search
that utilize its structured data.

In the coming weeks, we'll be releasing more detailed specifications
that will describe our support of semantic web standards. Initially, we
plan to support a number of microformats, including hCard, hCalendar, hReview, hAtom, and XFN.
Yahoo! Search will work with the web community to evolve the vocabulary
framework for embedding structured data. For starters, we plan to
support vocabulary components from Dublin Core, Creative Commons, FOAF, GeoRSS, MediaRSS, and others based on feedback. And, we will support RDFa and eRDF markup to embed these into existing HTML pages. Finally, we are announcing support for the OpenSearch specification, with extensions for structured queries to deep web data sources.
We believe that our open approach will let each of these formats
evolve within their own passionate communities, while providing the
necessary incentive to site owners (increased traffic from search) for
more widespread adoption. Site owners interested in learning more about
the open search platform can sign up here.
A Developer Ecosystem for Search
We're also announcing, today, that the Yahoo! Search open platform will
be open to all third party developers. We will be kicking off this
component of our open platform with a developer launch party at our
Sunnyvale campus in the coming weeks. That day, we'll launch a beta
program for a tool that developers can use to build Enhanced Results
applications for the Yahoo! Search platform. Enhanced Results apps
built by developers can utilize the structured data available through
public APIs and in our index (made available by site owners through
either feeds or the semantic web standards discussed above).
Let us know what you think below and keep an eye on the Search Blog
-- we'll be posting more info about the upcoming launch party.
Amit Kumar
Director, Product Management, Yahoo! Search
|
|
|
|
|
|
|
|
13.03.08 http://svobodanews.ru/Article/2008/03/13/20080313160454010.html
Создатель
Would Wide Web Тим Бернерс-Ли (Tim Berners-Lee) заявил TimesOnline, что
Google будет вытеснен новой структурой Всемирной паутины так называемым
Semantic Web (семантическая паутина). Над этим проектом Тим Бернерс-Ли
работает уже много лет. Он считает, что Google хорошо работает только с
текстовыми интернет-страницами, но сегодня интернет представляет собой
гораздо более разнообразную информацию и для ее объединения методов
Google уже недостаточно. Тим Бернерс-Ли возглавляет Консорциум WWW (The World Wide Web Consortium – W3C),
который основан MIT (Massachusetts Institute of Technology and the
University of Southampton). Здесь уже много лет продолжаются
исследования Semantic Web. Как считает Бернерс-Ли, Semantic Web
позволит сводить воедино любые, казалось бы, несопоставимые источники
информации – будь то банковские счета, фотографии или звуковые файлы. В
основе концепции лежит работа с метаданными, которые характеризуют
свойства и содержание интернет-ресурсов. Google использует чисто
текстовый анализ документов. Используя «семантическую паутину», которая
будет способна распознавать интересующую вас информацию, заключенную в
совершенно разных формах интернет-данных, можно будет сконструировать
веб-приложения, которые будут намного мощнее тех, что используются
сейчас в Интернете. Серьезное преимущество Google заключается в том,
что поисковая система не требует от пользователя никаких дополнительных
описаний ресурса, а работает с теми данными которые размещены на
ресурсе. Тим Бернерс-Ли говорит,
что семантическая паутина не будет подменять собой Всемирную паутину, а
только станет ее надстройкой, переводя все содержимое на язык, более
понятный компьютерам. В повседневной жизни, говорит он, это будет
выглядеть примерно так: «Возьмем пример из сферы семейного бюджета.
Обычно, анализируя банковские счета, мы мучительно вспоминаем
обстоятельства своих денежных трат, особенно при заполнении налоговой
декларации. Создаваемая нами семантическая паутина позволит свести
воедино полностью несопоставимые вещи, такие как банковские счета и
календарь, заставит их использовать один и тот же язык и делиться
информацией друг с другом. Приставив на экране с помощью мышки счет к
календарю, на последнем вы увидите заштрихованные участки, означающие
дни ваших денежных расходов, - и сразу вспомните обстоятельства, в
которых выписывали тот или иной чек». По словам ученого,
невероятно популярные сегодня социальные вебсайты, такие как MySpace
или FaceBook (аналогом которых является российская социальная сеть
«Одноклассники»), в конце концов будут вытеснены сетями, соединяющими
не только людей, но и самые разнообразные объекты, размещенные в Сети. Главная
проблема создания семантической паутины, по мнению экспертов, состоит в
том, чтобы найти универсальный способ распознания любого документа,
помещаемого в Интернет, и фиксации ссылок на него. В случае удачи это
сулит революционные преобразования во многих отраслях.
|
|
|
|
|
|
|
|
The Semantic Web (or Web 3.0) promises to “organize the world’s
information” in a dramatically more logical way than Google can ever
achieve with their current engine design. This is specially true from
the point of view of machine comprehension as opposed to human
comprehension.The Semantic Web requires the use of a declarative
ontological language like OWL to produce domain-specific ontologies
that machines can use to reason about information and make new
conclusions, not simply match keywords.
However, the Semantic Web, which is still in a development phase
where researchers are trying to define the best and most usable design
models, would require the participation of thousands of knowledgeable
people over time to produce those domain-specific ontologies necessary
for its functioning.
Machines (or machine-based reasoning, aka AI software or ‘info
agents’) would then be able to use those laboriously –but not entirely
manually– constructed ontologies to build a view (or formal model) of
how the individual terms within the information relate to each other.
Those relationships can be thought of as the axioms (basic
assumptions), which together with the rules governing the inference
process both enable as well as constrain the interpretation (and well-formed use)
of those terms by the info agents to reason new conclusions based on
existing information, i.e. to think. In other words, theorems (formal
deductive propositions that are provable based on the axioms and the
rules of inference) may be generated by the software, thus allowing
formal deductive reasoning at the machine level. And given that an
ontology, as described here, is a statement of Logic Theory, two or
more independent info agents processing the same domain-specific
ontology will be able to collaborate and deduce an answer to a query,
without being driven by the same software.
Thus, and as stated, in the Semantic Web individual machine-based
agents (or a collaborating group of agents) will be able to understand
and use information by translating concepts and deducing new
information rather than just matching keywords.
Once machines can understand and use information,
using a standard ontology language, the world will never be the same.
It will be possible to have an info agent (or many info agents) among
your virtual AI-enhanced workforce each having access to different
domain specific comprehension space and all communicating with each
other to build a collective consciousness.
You’ll be able to ask your info agent or agents to find you the
nearest restaurant that serves Italian cuisine, even if the restaurant
nearest you advertises itself as a Pizza joint as opposed to an Italian
restaurant. But that is just a very simple example of the deductive
reasoning machines will be able to perform on information they have.
Far more awesome implications can be seen when you consider that
every area of human knowledge will be automatically within the
comprehension space of your info agents. That is because each info
agent can communicate with other info agents who are specialized in
different domains of knowledge to produce a collective consciousness
(using the Borg metaphor) that encompasses all human knowledge. The
collective “mind” of those agents-as-the-Borg will be the Ultimate
Answer Machine, easily displacing Google from this position, which it
does not truly fulfill.
The problem with the Semantic Web, besides that researchers are
still debating which design and implementation of the ontology language
model (and associated technologies) is the best and most usable, is
that it would take thousands or tens of thousands of knowledgeable
people many years to boil down human knowledge to domain specific
ontologies.
However, if we were at some point to take the Wikipedia community and give them the right
tools and standards to work with (whether existing or to be developed
in the future), which would make it possible for reasonably skilled
individuals to help reduce human knowledge to domain-specific
ontologies, then that time can be shortened to just a few years, and
possibly to as little as two years.
The emergence of a Wikipedia 3.0 (as in Web 3.0, aka Semantic Web)
that is built on the Semantic Web model will herald the end of Google
as the Ultimate Answer Machine. It will be replaced with “WikiMind”
which will not be a mere search engine like Google is but a true Global
Brain: a powerful pan-domain inference engine, with a vast set of
ontologies (a la Wikipedia 3.0) covering all domains of human
knowledge, that can reason and deduce answers instead of just throwing
raw information at you using the outdated concept of a search engine.
Notes
After writing the original post I found out that the Wikipedia
application, also known as MediaWiki and not to be confused with
Wikipedia.org, has already been used to implement ontologies. The name that they’ve chosen is Ontoworld. I think WikiMind or WikiBorg would have been a cooler name, but I like ontoworld, too, as in “and it descended onto the world,” since that may be a reference to the global mind a Semantic-Web-enabled OntoWorld would lead to.
READ MORE |
|
|
|
|
|
|
|