Is there a way I can Download images from any search engine with a code like this?

Mangs

0人浏览 · 2022-08-24 17:01:58

Mangs · 2022-08-24 17:01:58 发布

Answer a question

I tried downloading images from bing to a directory but due to some reason the code just executes and gives me nothing.. not even an error.. I used the user-agent HTTP as well.. but it still doesnt seem to be working.. What should i do?

from bs4 import BeautifulSoup
import requests
from PIL import Image
from io import BytesIO

url = 'https://www.bing.com/search'
search = input("Search for: ")
headers = {'User-Agent': 'Mozilla/5.0 (X11; Ubuntu; Linux x86_64; rv:80.0) Gecko/20100101 
Firefox/80.0'}
params = {"q": search}
r = requests.get(url, headers=headers, params=params)

soup = BeautifulSoup(r.text, "html.parser")
links = soup.findAll("a", {"class": "thumb"})

for item in links:
     img_obj = requests.get(item.attrs["href"])
     print("Getting", item.attrs["href"])
     title = item.attrs["href"].split("/")[-1]
     img = Image.open(BytesIO(img_obj.content))
     img.save("./scraped_images/" + title, img.format)

Answers

To get all images, you need to add /images to the link. Here's an example with modifications to your code:

from bs4 import BeautifulSoup
from PIL import Image
from io import BytesIO
import requests
import json

search = input("Search for: ")

url = "https://www.bing.com/images/search"

headers = {
    "User-Agent": "Mozilla/5.0 (X11; Ubuntu; Linux x86_64; rv:80.0) Gecko/20100101 Firefox/80.0"
}
params = {"q": search, "form": "HDRSC2", "first": "1", "scenario": "ImageBasicHover"}
r = requests.get(url, headers=headers, params=params)

soup = BeautifulSoup(r.text, "html.parser")
links = soup.find_all("div", {"class": "img_cont hoff"})

for data in soup.find_all("a", {"class": "iusc"}):
    json_data = json.loads(data["m"])
    img_link = json_data["murl"]
    img_object = requests.get(img_link, headers=headers)
    title = img_link.split("/")[-1]

    print("Getting: ", img_link)
    print("Title: ", title + "\n")

    img = Image.open(BytesIO(img_object.content))
    img.save("./scraped_images/" + title)

Python

Python社区为您提供最前沿的新闻资讯和知识内容

更多推荐

求助！为什么用InsCode部署会出现无限重定向？

Python

如何重塑熊猫。系列

问题:如何重塑熊猫。系列在我看来,它就像 pandas.Series 中的一个错误。 a = pd.Series([1,2,3,4]) b = a.reshape(2,2) b b 有类型 Series 但无法显示,最后一条语句给出异常,非常冗长,最后一行是“TypeError: %d format: a number is required, not numpy.ndarray”。 b.sha

Python

在哪里可以找到有关 Keras 中默认权重初始化器的文档? [复制]

问题:在哪里可以找到有关 Keras 中默认权重初始化器的文档? [复制] 我刚刚在这里](https://keras.io/initializers/)中阅读了有关[中的 Keras 权重初始化器的信息。在文档中,只介绍了不同的初始化程序。如: model.add(Dense(64, kernel_initializer='random_normal')) 当我没有指定kernel_initia