represent or “provide a summary” of the clusters they stand for.The redundancy-aware top-k patterns make a trade-off between significance and redundancy. The three patterns chosen here to have high significance and low redundancy. It also Propose the MMS (Maximal MarginalSignificance) for measuring the combined significance of a pattern set Reference:[Jiawei Han. (2011) Data mining : concepts and techniques (The Morgan Kaufmann Series in Data Management Systems).waltham, Massachusetts: Morgan Kaufmann.]4
Srihari Kosanam7.12 It is interesting to generate semantic annotations for mined patterns. Section 7.6.1 presented a pattern annotation method. Alternative methods are possible, suchas by utilizing type information. In the DBLP data set, for example, authors, conferences, terms, and papers form multi-typed data. Develop a method for automated semantic pattern annotation that makes good use of typed information.Solution:A tool named Ontea can be used to generate automated semantic annotations.The method for ontea is as follows:Ontea identifies objects, their properties or their position in the text by applying patterns on a text. The input of the method is text and patterns and the output is key-value pairs which can be transformed into RDF/OWL individuals.It searches for relevant instances in local knowledge base based on common patterns.Ontea creates new instances of the objects found in text. Ontea creation IR with the use of information retrieval techniques, e.g. Lucene or RFTS, is used to identify a relevance value of created instance.Ontea works over text applicable to an application problem domain that is described by a domain ontological model and uses regular expressions to identify relations between text and a semantic model.Its architecture facilitates customizable and extensible transformation chain of key-value pairs extracted from text.It is also able to assign discovered or created object properties to individual which represents processed text or its part.Thus, Ontea platform can generate effective automated semantic pattern annotation that makes good use of typed information.Reference:[Laclavik, Michal & Hluchý, Ladislav & Šeleng, Martin & Ciglan, Marek. (2009). Ontea: Platform for Pattern Based Automated Semantic Annotation.. Computing and Informatics. (28. 555-579).]5
You've reached the end of your free preview.
Want to read all 5 pages?
Data Mining, representative, Association rule learning, Category of sets