Similar Items: Spark-based DBSCAN++ for efficient density-based clustering