Similar Items: How challenging it is to identify real code authors: an empirical study