python_kmp_串的模式匹配算法（python实现kmp算法）-eolink官网

python_kmp_串的模式匹配算法（python实现kmp算法）

the reference :

understand the algorithm,it make sense to use your pen to draw different cases of the kmp matcher process.

code

the auxiliary function to do prepare calculate the information of the pattern

def pre_calculate_next_list(pattern): """get the offset(the elements of the returned list could use as the index of next character of the pattern to match ) after mismatched correspondingly the list is calculated by the pattern independently! Args: pattern (str): pattern string Returns: list: the elements are the offset after mismatched you could match from where to continue of the pattern in each cases; you can use the number of matched characters-1 as index to query the length of the longest common sequence prefix and postfix,or the elements of the returned list could use as the index of next character of the pattern to match with the string to speed up the matching process """ len_list = len(pattern) # attention,the length value is based on the true(strict) substring,it does not include the string itself next_list = [] next_list.append(0) # the first value(next_list[0]) is initialize as 0(the first length element value means the number of the matched char is only 1;similarly,the second value of the next_list is mean that the matched character is 2,...) # the the adding char index of to calculate the next scale of the longest common substring prefix and postfix # the_adding_char_index indicated the scale increasingly(it also means that the last matched character's successor to be tested) the_adding_char_index = 1 # the length which has been calculated (the now_lengths will be the element of the next_list) now_length = 0 # meanwhile the now_length could as the index to judge the_adding_char_index character wheather matched the new prefix #the length counting from character pattern[0] always!it may be longer or shorter depend on the different cases(namely,different the_adding_char_index cases) # the process involves three scale values:the current scale lenght value to calculate:now_length(the judging value),now_length-1(the known length value); # the while loop will calculate the seconde value of the next_list(and other more later length values) while the_adding_char_index < len_list: """ the lucky case:the prefix is the same after add a character judge the new character to be wheather it could be add to the new scale prefix we must know that althouth there are different scales(cases),but all prefix have the same begining characters""" # attention,the first character of pattern is pattern[0] # the pattern[the_adding_char_index] is from the second character of the pattern if pattern[now_length] == pattern[the_adding_char_index]: # matched! now_length += 1 the_adding_char_index += 1 # this new scale is calculated! it could be recorded into the next_list next_list.append(now_length) # mismatched: else: #in the case, the now_length is >= 1(because we intent to visit the next_list[now_length-1]) # else,then execute the last 'else' branch case if now_length: # to iterate the length value # use the last scale calculated length value (little smaller the the currently scale now_length) to calculate the next length value(larger scale) # this is the essential part of the kmp algorithm # use the now_length-1(the known length value) to calculate the current calculating scale # this method transform the case to case lucky one(just the scale become smaller) to solve now_length = next_list[now_length-1] # the now_length>=0 else: # explictly set the length value as 0 in this case next_list.append(0) the_adding_char_index += 1 return next_list

kmp(string,pattern)

def kmp(string, pattern): s = 0 # offset # the position will continuosly to match(as the specified index of the pattern str) postion_to_continue = 0 next_list = pre_calculate_next_list(pattern) while s < len(string): # matched! # if two charactor are identical ,step the index (s and pos) # the same to the brute force (naive)method if string[s] == pattern[postion_to_continue]: s += 1 postion_to_continue += 1 elif postion_to_continue: # mismatched!(postion_to_continue>=1) # accroding the next_list to locate the charactor where to match continuosly # postion_to_continue-1>=0 # the next_list count from 0,so,if we want to visit the first element of the next_list, we use next_list[0]; postion_to_continue = next_list[postion_to_continue-1] else: s += 1 # keep the postion_to_continue no change # judge if a substring is matched completely,then print the position (count from 1) of # the string if postion_to_continue == len(pattern): print(s-postion_to_continue+1) postion_to_continue = next_list[postion_to_continue-1]string = "teababaca_aaaeeaae"# pattern = "ea"pattern1 = "eea"# pattern="aacaa"# pattern="aadabaadaadaa"# pattern = "acbabaca"pattern2 = "ababaca"# print(pre_calculate_next_list(pattern1))# print(kmp())kmp(string, pattern1)kmp(string,pattern2)

C#接口在派生类和外部类中的调用方法示例

327 2022-08-30

python_kmp_串的模式匹配算法（python实现kmp算法）

Flask接口签名sign原理与实例代码浅析

Android如何实现socket通信统一接口

C#接口在派生类和外部类中的调用方法示例

推荐文章

接口调用是什么意思？几种常用接口调用方式

接口设计原则

8款在线 API 接口文档管理工具

api管理系统是什么？

什么是接口调试？接口调试的步骤有哪些？

api 接口管理系统有哪些？

接口测试有几种测试方法

API文档生成工具有哪些？

微服务和api网关区别

交换机配置步骤

最近发表

热评文章

在线接口文档管理工具推荐，支持在线测试，HTTP接口

开源的在线接口文档wiki工具Mindoc的介绍与使

如何优雅的进行接口设计？接口设计的六大原则是什么？

什么是API测试,api检测公司

遇到百度网址安全中心提醒您该页面可能存在钓鱼欺诈信息

软件接口设计怎么做？前后端分离软件接口设计思路

python_kmp_串的模式匹配算法（python实现kmp算法）

微信扫一扫：分享

推荐文章

最近发表

热评文章