The redundancy aware top k patterns make a trade off between significance and

The redundancy aware top k patterns make a trade off

This preview shows page 4 - 5 out of 5 pages.

represent or “provide a summary” of the clusters they stand for. The redundancy-aware top-k patterns make a trade-off between significance and redundancy. The three patterns chosen here to have high significance and low redundancy. It also Propose the MMS (Maximal Marginal Significance) for measuring the combined significance of a pattern set Reference: [Jiawei Han. (2011) Data mining : concepts and techniques (The Morgan Kaufmann Series in Data Management Systems) . waltham , Massachusetts: Morgan Kaufmann.] 4
Image of page 4
Srihari Kosanam 7.12 It is interesting to generate semantic annotations for mined patterns. Section 7.6.1 presented a pattern annotation method. Alternative methods are possible, such as by utilizing type information. In the DBLP data set, for example, authors, conferences, terms, and papers form multi-typed data. Develop a method for automated semantic pattern annotation that makes good use of typed information. Solution: A tool named Ontea can be used to generate automated semantic annotations. The method for ontea is as follows: Ontea identifies objects, their properties or their position in the text by applying patterns on a text. The input of the method is text and patterns and the output is key-value pairs which can be transformed into RDF/OWL individuals. It searches for relevant instances in local knowledge base based on common patterns. Ontea creates new instances of the objects found in text. Ontea creation IR with the use of information retrieval techniques, e.g. Lucene or RFTS, is used to identify a relevance value of created instance. Ontea works over text applicable to an application problem domain that is described by a domain ontological model and uses regular expressions to identify relations between text and a semantic model. Its architecture facilitates customizable and extensible transformation chain of key-value pairs extracted from text. It is also able to assign discovered or created object properties to individual which represents processed text or its part. Thus, Ontea platform can generate effective automated semantic pattern annotation that makes good use of typed information. Reference: [Laclavik, Michal & Hluchý, Ladislav & Šeleng, Martin & Ciglan, Marek. (2009). Ontea: Platform for Pattern Based Automated Semantic Annotation.. Computing and Informatics. (28. 555-579).] 5
Image of page 5

You've reached the end of your free preview.

Want to read all 5 pages?

  • Fall '08
  • JIN,M
  • Data Mining, representative, Association rule learning, Category of sets

  • Left Quote Icon

    Student Picture

  • Left Quote Icon

    Student Picture

  • Left Quote Icon

    Student Picture

Stuck? We have tutors online 24/7 who can help you get unstuck.
A+ icon
Ask Expert Tutors You can ask You can ask You can ask (will expire )
Answers in as fast as 15 minutes