Python3實現(xiàn)的簡單驗證碼識別功能示例
本文實例講述了Python3實現(xiàn)的簡單驗證碼識別功能。分享給大家供大家參考,具體如下:
這次的需求是自動登錄某機構(gòu)網(wǎng)站, 其驗證碼很具特色, 很適合做驗證碼識別入門demo, 先貼主要代碼, 其中圖片對比使用了編輯距離算法, 腳本使用了pillow庫
from PIL import Image
import requests
import re
splitter = re.compile(r'\d{30}') # 分割二值化后的圖片
# distance('11110000', '00000000')
# 比較兩個字符串有多少位不同, 返回不同的位數(shù)
def distance(string1, string2):
d_str1 = len(string1)
d_str2 = len(string2)
d_arr = [[0] * d_str2 for i in range(d_str1)]
for i in range(d_str1):
for j in range(d_str2):
if string1[i] == string2[j]:
if i == 0 and j == 0:
d_arr[i][j] = 0
elif i != 0 and j == 0:
d_arr[i][j] = d_arr[i - 1][j]
elif i == 0 and j != 0:
d_arr[i][j] = d_arr[i][j - 1]
else:
d_arr[i][j] = d_arr[i - 1][j - 1]
else:
if i == 0 and j == 0:
d_arr[i][j] = 1
elif i != 0 and j == 0:
d_arr[i][j] = d_arr[i - 1][j] + 1
elif i == 0 and j != 0:
d_arr[i][j] = d_arr[i][j - 1] + 1
else:
d_arr[i][j] = min(d_arr[i][j - 1], d_arr[i - 1][j], d_arr[i - 1][j - 1]) + 1
current = max(d_arr[d_str1 - 1][d_str2 - 1], abs(d_str2 - d_str1))
# print("Levenshtein Distance is",current)
# print(current)
return current
# 去除字符串里面連續(xù)的1
def no_one(string):
n_arr = splitter.findall(string)
n_arr = filter(lambda each_str: each_str != '111111111111111111111111111111', n_arr)
n_result = ''
for n_each in n_arr:
n_result += str(n_each)
return n_result
opener = requests.session()
res = opener.get('http://60.211.254.236:8402/Ajax/ValidCodeImg.ashx').content
with open('verify.gif', 'wb') as v:
v.write(res)
img = Image.open('verify.gif')
img = img.convert('L')
size = img.size
# img = img.point(table, '1')
img_arr = img.load()
# for x in range(size[0]):
# for y in range(size[1]):
# if img_arr[x, y] > 210:
# img_arr[x, y] = 1
# else:
# img_arr[x, y] = 0
# img.save('after.gif')
inc = 0
str1 = ''
str2 = ''
str3 = ''
cur_str = ''
for x in range(size[0]):
for y in range(size[1]):
if img_arr[x, y] > 210:
cur_str += '1'
else:
cur_str += '0'
# print(img_arr[i, j], end='')
# cur_str += str(img_arr[x, y])
inc += 1
# if inc % 18 == 0:
# print('\n----')
# else:
# print('')
if inc == 18:
str1 = cur_str
cur_str = ''
elif inc == 36:
str2 = cur_str
cur_str = ''
elif inc == 54:
str3 = cur_str
cur_str = ''
str1 = str1[:-60]
str2 = str2[:-60]
str3 = str3[:-60]
str1 = no_one(str1)
str2 = no_one(str2)
str3 = no_one(str3)
str1 = str1.strip('1')
str2 = str2.strip('1')
str3 = str3.strip('1')
# print(str1)
# print(str3)
with open('./dict/plus') as plus:
with open('./dict/minus') as minus:
p = plus.read()
m = minus.read()
is_add = 1 if distance(p, str2) < distance(m, str2) else 0
arr1 = []
arr3 = []
for each in range(1, 10):
with open('./dict/{}'.format(each)) as f:
ff = f.read()
arr1.append([each, distance(ff, str1)])
arr3.append([each, distance(ff, str3)])
arr1 = sorted(arr1, key=lambda item: item[1])
arr3 = sorted(arr3, key=lambda item: item[1])
result = arr1[0][0] + arr3[0][0] if is_add else arr1[0][0] - arr3[0][0]
print(result)
# login_url = 'http://60.211.254.236:8402/Ajax/Login.ashx?Method=G3_Login'
# login_data = {
# 'loginname': usn,
# 'password': pwd,
# 'validcode': result,
#
# }
# opener.get(login_url, login_data)
字庫已經(jīng)部署到GitHub地址:https://github.com/hldh214/validCode/
更多關(guān)于Python相關(guān)內(nèi)容感興趣的讀者可查看本站專題:《Python圖片操作技巧總結(jié)》、《Python數(shù)據(jù)結(jié)構(gòu)與算法教程》、《Python Socket編程技巧總結(jié)》、《Python函數(shù)使用技巧總結(jié)》、《Python字符串操作技巧匯總》、《Python入門與進階經(jīng)典教程》及《Python文件與目錄操作技巧匯總》
希望本文所述對大家Python程序設(shè)計有所幫助。
相關(guān)文章
python實現(xiàn)數(shù)通設(shè)備tftp備份配置文件示例
這篇文章主要介紹了python實現(xiàn)數(shù)通設(shè)備tftp備份配置文件示例,需要的朋友可以參考下2014-04-04
解讀torch.cuda.amp自動混合精度訓(xùn)練之節(jié)省顯存并加快推理速度
這篇文章主要介紹了torch.cuda.amp自動混合精度訓(xùn)練之節(jié)省顯存并加快推理速度問題,具有很好的參考價值,希望對大家有所幫助,如有錯誤或未考慮完全的地方,望不吝賜教2023-08-08
python?配置uwsgi?啟動Django框架的詳細教程
這篇文章主要介紹了python?配置uwsgi?啟動Django框架,本文給大家講解的非常詳細,對大家的學(xué)習(xí)或工作具有一定的參考借鑒價值,需要的朋友可以參考下2022-12-12

