Email:   Password:     Remember me
Create blog

Zigmas Bigelis' Blog

Blog about creativity, self-improvement, Web research, social systems, semantic systems, Web 2.0, 3.0, 4.0 etc

Posts tagged with "semantic web"

The Yahoo! Search Open Ecosystem

 Read the Yahoo blog about Yahoo and Semantic Web
http://www.ysearchblog.com/archives/000527.html

A few weeks ago, we began talking about the new Yahoo! Search open platform. Today, we're releasing more details about two important components of the initiative -- the developer platform as well as our support of a number of semantic web standards.

The Data Web in Action
While there has been remarkable progress made toward understanding the semantics of web content, the benefits of a data web have not reached the mainstream consumer. Without a killer semantic web app for consumers, site owners have been reluctant to support standards like RDF, or even microformats. We believe that app can be web search.

By supporting semantic web standards, Yahoo! Search and site owners can bring a far richer and more useful search experience to consumers. For example, by marking up its profile pages with microformats, LinkedIn can allow Yahoo! Search and others to understand the semantic content and the relationships of the many components of its site. With a richer understanding of LinkedIn's structured data included in our index, we will be able to present users with more compelling and useful search results for their site. The benefit to LinkedIn is, of course, increased traffic quality and quantity from sites like Yahoo! Search that utilize its structured data.

linkedin_FINAL.JPG

In the coming weeks, we'll be releasing more detailed specifications that will describe our support of semantic web standards. Initially, we plan to support a number of microformats, including hCard, hCalendar, hReview, hAtom, and XFN. Yahoo! Search will work with the web community to evolve the vocabulary framework for embedding structured data. For starters, we plan to support vocabulary components from Dublin Core, Creative Commons, FOAF, GeoRSS, MediaRSS, and others based on feedback. And, we will support RDFa and eRDF markup to embed these into existing HTML pages. Finally, we are announcing support for the OpenSearch specification, with extensions for structured queries to deep web data sources.

We believe that our open approach will let each of these formats evolve within their own passionate communities, while providing the necessary incentive to site owners (increased traffic from search) for more widespread adoption. Site owners interested in learning more about the open search platform can sign up here.

A Developer Ecosystem for Search
We're also announcing, today, that the Yahoo! Search open platform will be open to all third party developers. We will be kicking off this component of our open platform with a developer launch party at our Sunnyvale campus in the coming weeks. That day, we'll launch a beta program for a tool that developers can use to build Enhanced Results applications for the Yahoo! Search platform. Enhanced Results apps built by developers can utilize the structured data available through public APIs and in our index (made available by site owners through either feeds or the semantic web standards discussed above).

Let us know what you think below and keep an eye on the Search Blog -- we'll be posting more info about the upcoming launch party.


Amit Kumar
Director, Product Management, Yahoo! Search


Тим Бернерс-Ли: Google придется потесниться

http://svobodanews.ru/Article/2008/03/13/20080313160454010.html

Создатель Would Wide Web Тим Бернерс-Ли (Tim Berners-Lee) заявил TimesOnline, что Google будет вытеснен новой структурой Всемирной паутины так называемым Semantic Web (семантическая паутина). Над этим проектом Тим Бернерс-Ли работает уже много лет. Он считает, что Google хорошо работает только с текстовыми интернет-страницами, но сегодня интернет представляет собой гораздо более разнообразную информацию и для ее объединения методов Google уже недостаточно.

Тим Бернерс-Ли возглавляет Консорциум WWW (The World Wide Web Consortium – W3C), который основан MIT (Massachusetts Institute of Technology and the University of Southampton). Здесь уже много лет продолжаются исследования Semantic Web.

Как считает Бернерс-Ли, Semantic Web позволит сводить воедино любые, казалось бы, несопоставимые источники информации – будь то банковские счета, фотографии или звуковые файлы.

В основе концепции лежит работа с метаданными, которые характеризуют свойства и содержание интернет-ресурсов. Google использует чисто текстовый анализ документов. Используя «семантическую паутину», которая будет способна распознавать интересующую вас информацию, заключенную в совершенно разных формах интернет-данных, можно будет сконструировать веб-приложения, которые будут намного мощнее тех, что используются сейчас в Интернете. Серьезное преимущество Google заключается в том, что поисковая система не требует от пользователя никаких дополнительных описаний ресурса, а работает с теми данными которые размещены на ресурсе.

Тим Бернерс-Ли говорит, что семантическая паутина не будет подменять собой Всемирную паутину, а только станет ее надстройкой, переводя все содержимое на язык, более понятный компьютерам. В повседневной жизни, говорит он, это будет выглядеть примерно так: «Возьмем пример из сферы семейного бюджета. Обычно, анализируя банковские счета, мы мучительно вспоминаем обстоятельства своих денежных трат, особенно при заполнении налоговой декларации. Создаваемая нами семантическая паутина позволит свести воедино полностью несопоставимые вещи, такие как банковские счета и календарь, заставит их использовать один и тот же язык и делиться информацией друг с другом. Приставив на экране с помощью мышки счет к календарю, на последнем вы увидите заштрихованные участки, означающие дни ваших денежных расходов, - и сразу вспомните обстоятельства, в которых выписывали тот или иной чек».

По словам ученого, невероятно популярные сегодня социальные вебсайты, такие как MySpace или FaceBook (аналогом которых является российская социальная сеть «Одноклассники»), в конце концов будут вытеснены сетями, соединяющими не только людей, но и самые разнообразные объекты, размещенные в Сети.

Главная проблема создания семантической паутины, по мнению экспертов, состоит в том, чтобы найти универсальный способ распознания любого документа, помещаемого в Интернет, и фиксации ссылок на него. В случае удачи это сулит революционные преобразования во многих отраслях.

 

Wikipedia 3.0: The End of Google?

 

The Semantic Web (or Web 3.0) promises to “organize the world’s information” in a dramatically more logical way than Google can ever achieve with their current engine design. This is specially true from the point of view of machine comprehension as opposed to human comprehension.The Semantic Web requires the use of a declarative ontological language like OWL to produce domain-specific ontologies that machines can use to reason about information and make new conclusions, not simply match keywords.

However, the Semantic Web, which is still in a development phase where researchers are trying to define the best and most usable design models, would require the participation of thousands of knowledgeable people over time to produce those domain-specific ontologies necessary for its functioning.

Machines (or machine-based reasoning, aka AI software or ‘info agents’) would then be able to use those laboriously –but not entirely manually– constructed ontologies to build a view (or formal model) of how the individual terms within the information relate to each other. Those relationships can be thought of as the axioms (basic assumptions), which together with the rules governing the inference process both enable as well as constrain the interpretation (and well-formed use) of those terms by the info agents to reason new conclusions based on existing information, i.e. to think. In other words, theorems (formal deductive propositions that are provable based on the axioms and the rules of inference) may be generated by the software, thus allowing formal deductive reasoning at the machine level. And given that an ontology, as described here, is a statement of Logic Theory, two or more independent info agents processing the same domain-specific ontology will be able to collaborate and deduce an answer to a query, without being driven by the same software.

Thus, and as stated, in the Semantic Web individual machine-based agents (or a collaborating group of agents) will be able to understand and use information by translating concepts and deducing new information rather than just matching keywords.

Once machines can understand and use information, using a standard ontology language, the world will never be the same. It will be possible to have an info agent (or many info agents) among your virtual AI-enhanced workforce each having access to different domain specific comprehension space and all communicating with each other to build a collective consciousness.

You’ll be able to ask your info agent or agents to find you the nearest restaurant that serves Italian cuisine, even if the restaurant nearest you advertises itself as a Pizza joint as opposed to an Italian restaurant. But that is just a very simple example of the deductive reasoning machines will be able to perform on information they have.

Far more awesome implications can be seen when you consider that every area of human knowledge will be automatically within the comprehension space of your info agents. That is because each info agent can communicate with other info agents who are specialized in different domains of knowledge to produce a collective consciousness (using the Borg metaphor) that encompasses all human knowledge. The collective “mind” of those agents-as-the-Borg will be the Ultimate Answer Machine, easily displacing Google from this position, which it does not truly fulfill.

The problem with the Semantic Web, besides that researchers are still debating which design and implementation of the ontology language model (and associated technologies) is the best and most usable, is that it would take thousands or tens of thousands of knowledgeable people many years to boil down human knowledge to domain specific ontologies.

However, if we were at some point to take the Wikipedia community and give them the right tools and standards to work with (whether existing or to be developed in the future), which would make it possible for reasonably skilled individuals to help reduce human knowledge to domain-specific ontologies, then that time can be shortened to just a few years, and possibly to as little as two years.

The emergence of a Wikipedia 3.0 (as in Web 3.0, aka Semantic Web) that is built on the Semantic Web model will herald the end of Google as the Ultimate Answer Machine. It will be replaced with “WikiMind” which will not be a mere search engine like Google is but a true Global Brain: a powerful pan-domain inference engine, with a vast set of ontologies (a la Wikipedia 3.0) covering all domains of human knowledge, that can reason and deduce answers instead of just throwing raw information at you using the outdated concept of a search engine.

Notes

After writing the original post I found out that the Wikipedia application, also known as MediaWiki and not to be confused with Wikipedia.org, has already been used to implement ontologies. The name that they’ve chosen is Ontoworld. I think WikiMind or WikiBorg would have been a cooler name, but I like ontoworld, too, as in “and it descended onto the world,” since that may be a reference to the global mind a Semantic-Web-enabled OntoWorld would lead to.

READ MORE

What's Next on the Web

 I recommend to see presentation on the Web:

What's Next on the Web
Web Technology Trends for 2008 and Beyond