TY - GEN
T1 - Highly efficient optimal K-anonymity for biomedical datasets
AU - Kohlmayer, Florian
AU - Prasser, Fabian
AU - Eckert, Claudia
AU - Kemper, Alfons
AU - Kuhn, Klaus A.
PY - 2012
Y1 - 2012
N2 - K-anonymization is a wide-spread technique for the de-identification of biomedical datasets. To not render the data useless for further analysis it is often important to find an optimal solution to the k-anonymity problem, i.e., a transformation with minimum information loss. As performance is often a key requirement this paper describes an efficient implementation of a k-anonymization algorithm which is especially suitable for biomedical datasets. Although our basic implementation already offers excellent performance we present several further optimizations and show that these yield an additional speedup of up to a factor of five even for large datasets.
AB - K-anonymization is a wide-spread technique for the de-identification of biomedical datasets. To not render the data useless for further analysis it is often important to find an optimal solution to the k-anonymity problem, i.e., a transformation with minimum information loss. As performance is often a key requirement this paper describes an efficient implementation of a k-anonymization algorithm which is especially suitable for biomedical datasets. Although our basic implementation already offers excellent performance we present several further optimizations and show that these yield an additional speedup of up to a factor of five even for large datasets.
UR - http://www.scopus.com/inward/record.url?scp=84867325726&partnerID=8YFLogxK
U2 - 10.1109/CBMS.2012.6266366
DO - 10.1109/CBMS.2012.6266366
M3 - Conference contribution
AN - SCOPUS:84867325726
SN - 9781467320511
T3 - Proceedings - IEEE Symposium on Computer-Based Medical Systems
BT - Proceedings of the 25th IEEE International Symposium on Computer-Based Medical Systems, CBMS 2012
T2 - 25th IEEE International Symposium on Computer-Based Medical Systems, CBMS 2012
Y2 - 20 June 2012 through 22 June 2012
ER -