Sjclust: Towards a framework for integrating similarity join algorithms and clustering