Joel L. Wolf, Mark S. Squillante, et al.
IEEE Transactions on Knowledge and Data Engineering
String data is especially important in the privacy preserving data mining domain because most DNA and biological data is coded as strings. In this paper, we will discuss a new method for privacy preserving mining of string data with the use of simple template based condensation models. The template based model turns out to be effective in practice, and preserves important statistical characteristics of the strings.
Joel L. Wolf, Mark S. Squillante, et al.
IEEE Transactions on Knowledge and Data Engineering
Philip S. Yu, Xin Li, et al.
WWW Alt. 2004
Junyi Xie, Jun Yang, et al.
ICDE 2008
Douglas W. Cornell, Daniel M. Dias, et al.
IEEE Transactions on Software Engineering