MEGA: Multi-View Semi-Supervised Clustering of Hypergraphs
The Program
We develop a semi-supervised multi-view clustering algorithm that can effectively handle hypergraphs. We simultaneously consider multi-view clustering and semi-supervised learning to not only reflect multiple views of data (i.e., multiple attributes or content of the objects as well as auxiliary relationships among the objects besides the connection structure of a hypergraph) but also improve the clustering accuracy by utilizing partially available labels of objects. In our method, the hypergraph structure is considered as one of the views of given objects. Our method can be considered to be a general framework for semi-supervised multiview clustering in that the input data is not necessarily restricted to a hypergraph. Also, our method is flexible in the sense that it can be used as either an unsupervised learning method or a semi-supervised method. That is, when there is no prior knowledge about the labels, our method works as an unsupervised multi-view clustering method. We name our method MEGA by abbreviating Multi-view sEmi-supervised hyperGrAph clustering.
Download
The code is released under the GNU Public License (GPL). Please acknowledge the use of the code/data with a citation.
The code is written in MATLAB R2018b.
To download the code and the datasets, please click here.
-data
Five datasets are used in our paper: query, gene, cora, dblp5, and dblp10.
-mega
To run MEGA, please read/run the MATLAB scripts. For example, to run MEGA on the query dataset, type run_query in MATLAB.
-result
The results are saved in the 'result' folder. For example, when 'run_query.m' is executed, the result is saved in result/MEGA_query_H.mat file.
Please acknowledge the use of the code/data with a citation.
MEGA: Multi-View Semi-Supervised Clustering of Hypergraphs, J. J. Whang, R. Du, S. Jung, G. Lee, B. Drake, Q. Liu, S. Kang, and H. Park, Proceedings of the VLDB Endowments (PVLDB), 2020. [pdf]
@ARTICLE{whang-pvldb2020,
author={J. J. Whang and R. Du and S. Jung and G. Lee and B. Drake and Q. Liu and S. Kang and H. Park},
journal={Proceedings of the VLDB Endowment},
title={{MEGA}: Multi-View Semi-Supervised Clustering of Hypergraphs},
year={2020},
volume={13},
number={5},
pages={698--711}
}
Bug reports and comments are
always appreciated.