Nonparametric Statistical Structuring of Knowledge Systems Using Binary Feature Matches

Research output: Chapter in Book/Report/Conference proceedingArticle in proceedings

Structuring knowledge systems with binary features is often based on imposing a similarity measure and clustering objects according to this similarity. Unfortunately, such analyses can be heavily influenced by the choice of similarity measure. Furthermore, it is unclear at which level clusters have statistical support and how this approach generalizes to the structuring and alignment of knowledge systems. We propose a non-parametric Bayesian generative model for structuring binary feature data that does not depend on a specific choice of similarity measure. We jointly model all combinations of binary matches and structure the data into groups at the level in which they have statistical support. The model naturally extends to structuring and aligning an arbitrary number of systems. We analyze three datasets on educational concepts and their features and demonstrate how the proposed model can both be used to structure each system separately or to jointly align two or more systems. The proposed method forms a promising new framework for the statistical modeling and alignment of structure across an arbitrary number of systems.

Publication information

Original languageEnglish
Title of host publicationProceedings of MLSP 2014
EditorsMamadou Mboup, Tülay Adali, Éric Moreau, Jan Larsen
Number of pages6
Place of PublicationNew York
Publication date2014
ISBN (Print)9781479936946
StatePublished - 2014
EventThe 24th IEEE International Workshop on Machine Learning for Signal Processing. MLSP 2014 - Reims Centre des Congrès, Reims, France
Duration: 21 Sep 201424 Sep 2014
Conference number: 24


ConferenceThe 24th IEEE International Workshop on Machine Learning for Signal Processing. MLSP 2014
LocationReims Centre des Congrès

Bibliographical note

CBS Library does not have access to the material

    Research areas

  • Bayesian non-parametrics, Relational modeling, Binary similarity, Knowledge structuring

ID: 42559987