Data Entry: Please note that the research database will be replaced by UNIverse by the end of October 2023. Please enter your data into the system https://universe-intern.unibas.ch. Thanks
Anonymization of System Logs for Preserving Privacy and Reducing Storage
Editor(s)
Arai, Kohei; Kapoor, Supriya; Bhatia, Rahul
Book title (Conference Proceedings)
Advances in Information and Communication Networks. Proceedings of the 2018 Future of Information and Communication Conference (FICC)
Volume
2
Place of Conference
Singapore
Year of Conference
2018
Publisher
Springer
Place of Publication
Cham
Pages
162-179
ISSN/ISBN
978-3-030-03404-7 ; 978-3-030-03405-4
Keywords
Privacy; Anonymization; Encoding; System logs; Data quality; Size reduction; Performance improvement
Abstract
System logs constitute valuable information for analysis and diagnosis of systems behavior. The analysis is highly time-consuming for large log volumes. For many parallel computing centers, outsourcing the analysis of system logs (syslogs) to third parties is the only option. Therefore, a general analysis and diagnosis solution is needed. Such a solution is possible only through the syslog analysis from multiple computing systems. The data within syslogs can be sensitive, thus obstructing the sharing of syslogs across institutions, third-party entities, or in the public domain. This work proposes a new method for the anonymization of syslogs that employs de-identification and encoding to provide fully shareable system logs. In addition to eliminating the sensitive data within the test logs, the proposed anonymization method provides 25% performance improvement in post-processing of the anonymized syslogs, and more than 80% reduction in their required storage space.