python多线程爬取网页名称写入到excel-eolink官网

python多线程爬取网页名称写入到excel

#!/usr/bin/env python# coding: utf-8# In[1]:import pandas as pdimport threading import requestsfrom bs4 import BeautifulSoupfrom time import sleepfrom datetime import datetime# In[2]:df = pd.read_excel("网站对应名字.xlsx")# In[16]:sites = df.URLdata_count = len(sites)thread_count = 16threads = []n_loops = range(thread_count)# In[17]:names = [None]*data_count# In[18]:def get_url_title(site): try: html = requests.get(site) soup = BeautifulSoup(html.content) return soup.find("title").text except BaseException: return "网址有误"# In[19]:# 从改点开始def write_title(start): # 引用全局变量 global data_count,thread_count,names for i in range(start,data_count,thread_count): names[i] = get_url_title(sites[i]) print(i,names[i])# In[20]:def main(): global threads,n_loops for i in n_loops: t = threading.Thread(target=write_title,args=(i,)) threads.append(t) # 启动多个线程 for i in n_loops: threads[i].start() # wait for all threads to finish for i in n_loops: threads[i].join() # In[21]:if __name__ == '__main__': main()# In[22]:names# In[10]:names# In[11]:len(names)# In[12]:df.info# In[23]:import multiprocessingprint(multiprocessing.cpu_count())# In[ ]:

python怎么对修改密码接口进行压测

409 2022-08-26

python多线程爬取网页名称写入到excel

如何理解接口幂等性

python怎么对修改密码接口进行压测

java怎么实现Callable接口创建线程类

推荐文章

接口调用是什么意思？几种常用接口调用方式

接口设计原则

8款在线 API 接口文档管理工具

api管理系统是什么？

什么是接口调试？接口调试的步骤有哪些？

api 接口管理系统有哪些？

接口测试有几种测试方法

API文档生成工具有哪些？

微服务和api网关区别

交换机配置步骤

最近发表

热评文章

在线接口文档管理工具推荐，支持在线测试，HTTP接口

开源的在线接口文档wiki工具Mindoc的介绍与使

如何优雅的进行接口设计？接口设计的六大原则是什么？

什么是API测试,api检测公司

遇到百度网址安全中心提醒您该页面可能存在钓鱼欺诈信息

软件接口设计怎么做？前后端分离软件接口设计思路

python多线程爬取网页名称写入到excel

微信扫一扫：分享

推荐文章

最近发表

热评文章