Title
Constrained Non-Affine Alignment of Embeddings
Abstract
Embeddings are one of the fundamental building blocks for data analysis tasks. Embeddings are already essential tools for large language models and image analysis, and their use is being extended to many other research domains. The generation of these distributed representations is often a data-and computation-expensive process; yet the holistic analysis and adjustment of them after they have been created is still a developing area. In this paper, we first propose a very general quantitatively measure for the presence of features in the embedding data based on if it can be learned. We then devise a method to remove or alleviate undesired features in the embedding while retaining the essential structure of the data. We use a Domain Adversarial Network (DAN) to generate a non-affine transformation, but we add constraints to ensure the essential structure of the embedding is preserved. Our empirical results demonstrate that the proposed algorithm significantly outperforms the state-of-art unsupervised algorithm on several data sets, including novel applications from the industry.
Year
DOI
Venue
2021
10.1109/ICDM51629.2021.00179
2021 21ST IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2021)
Keywords
DocType
ISSN
embeddings, alignment
Conference
1550-4786
Citations 
PageRank 
References 
0
0.34
0
Authors
10
Name
Order
Citations
PageRank
Yuwei Wang120.70
Yan Zheng202.37
Yanqing Peng300.34
Chin-Chia Michael Yeh411.71
Zhongfang Zhuang500.68
Mahashweta Das671.75
Mangesh Bendre700.68
Feifei Li82242120.05
Wei Zhang9569.29
Jeff M. Phillips1053649.83