Abstract
Text based CAPTCHAs are ubiquitous on the Internet since they are easily generated by machines, easily solvable by humans and yet not easily defeated by state-of-the-art computer algorithms. Over the years, several attacks have been designed by researchers to solve different types of CAPTCHAs. These attacks always assume that the type of CAPTCHA is known. However, in order to devise a common frame work, comprising of different attacks that can be launched automatically, the first prime step is to recognize the CAPTCHA scheme. In this paper we present a method based on geometric features to automatically identify text based CAPTCHA schemes. The proposed method is verified on a data set comprising of 25 different types of CAPTCHA (1,000 samples per type). We achieve an identification / classification accuracy of up to approximately 99%.