A novel approach to generate a large scale of supervised data for short text sentiment analysis (original) (raw)