169 похожих чатов

Good evening, colleagues! I need help with own technical question as: What

technologies can be used for parser, susceptible to any variation in website design (DOM)?

Today this question will seem strange to someone, but believe me, tomorrow it will be irreplaceable thing, if you support to beat it🔥

I open to discuss.

Thank you for the attention.

28 ответов

41 просмотр

You can use liberties like bs4 & scrapy for the same

Malware (\/ /\ R |_| |\|)
You can use liberties like bs4 & scrapy for the sa...

They're asking for a solution so that they don't have to rewrite their scraper when the website changes.

ᅠ ᅠ ᅠ Maksym-R. Автор вопроса
Malware (\/ /\ R |_| |\|)
You can use liberties like bs4 & scrapy for the sa...

Does it work even if the whole design of the website will change, am I correctly understand your point here?

ᅠ ᅠ ᅠ Maksym-R. Автор вопроса
harꭑony5 (⊙ ◡ ⦿︎) ↺
If the whole site changes, then no.

I need something that will work even the whole website will change the design. Selenium is working only with specific design version.

ᅠ ᅠ ᅠ Maksym-R. Автор вопроса
Malware (\/ /\ R |_| |\|)
Idk if that's even possible

It should be possible, at least, with NLP

+ might be illegal for some spesific websites

Malware (\/ /\ R |_| |\|)
Tine to use or train a LLM to do it for you

Why llm? There surely must be another type of model more applicable to the use case?

ᅠ ᅠ ᅠ Maksym-R. Автор вопроса

I bring them new clients, and twice as many, I don’t see a problem with unhappy requests to the site, it sounds funny compared to how many new clients would come to them.

ᅠ ᅠ ᅠ Maksym R.
I bring them new clients, and twice as many, I don...

A) read what they said again B) https://t.me/thedevs_chat/636436

ᅠ ᅠ ᅠ Maksym-R. Автор вопроса
Malware (\/ /\ R |_| |\|)
A) read what they said again B) https://t.me/thed...

If for such sites it is such a big money for sent requests, why do they position themselves as large sites with serious turnover, judging by what is written about them? Or are they so broke that they can't afford new clients? Do you believe this yourself?

ᅠ ᅠ ᅠ Maksym-R. Автор вопроса
harꭑony5 (⊙ ◡ ⦿︎) ↺
Can you guarantee it?

Of course, I worked with clients from Dubai, I know what I'm saying

ᅠ ᅠ ᅠ Maksym-R. Автор вопроса
Malware (\/ /\ R |_| |\|)
Tine to use or train a LLM to do it for you

You can check easily, what are they providing on international level, as I see they are online retailer specializing in consumer electronics, gadgets, and men's fashion. The company provides products in various other categories like home & garden, bags, baby & kids, health & beauty and more from brands like Apepal, Zanflare, Utorch, Apple and more, and so on? So they can't find money on requests in exchange for new clients, right?

ᅠ ᅠ ᅠ Maksym R.
Of course, I worked with clients from Dubai, I kno...

But those sites don't know it. And if you can prove what you say to them, it would be better to get to an agreement with the sites to get the data directly from them without scraping.

ᅠ ᅠ ᅠ Maksym R.
You can check easily, what are they providing on i...

Imagine if multiple scrappers make multiple requests to the site, it can add up quickly. And as I said before, those site owners can't know for sure if what you say is true.

I got a few things to say about this question, too… for one, a link to quora where you asked the same thing doesn't count as research, and I'd have to check SO guidelines, but I think they don't like when people use chatgpt or similar as the main/only source.

ᅠ ᅠ ᅠ Maksym-R. Автор вопроса
harꭑony5 (⊙ ◡ ⦿︎) ↺
I got a few things to say about this question, too...

Research is to show the amount of tools, which could be used. Does it makes sense to show each website of these tools if I provide the list of it?

ᅠ ᅠ ᅠ Maksym-R. Автор вопроса
harꭑony5 (⊙ ◡ ⦿︎) ↺
But those sites don't know it. And if you can prov...

I prove when they received money from the first client I received % from it, so I think this is a clear proof.

ᅠ ᅠ ᅠ Maksym-R. Автор вопроса

Похожие вопросы

Обсуждают сегодня

Такс, блин, таки кто-то знает, каким образом работают макросы stdin/stdout/stderr? Я влез в stdio.h, там определения нет, отладил через асмокод - вызывается функция со странны...
The Bird of Hermes
18
Всем привет, на линуксе лучше на fasm или nasm учиться писать для начала ?
meszjol
14
я не магистр хаскеля, но разве не может лейзи тип конвертнуться в не-лейзи запросив вычисление содержимого прям при инициализации?
deadgnom32 λ madao
100
Если у меня есть такой класс: Object = {} function Object:new(a_name, a_transform, a_color, a_mesh, a_material, a_shader, a_textures) local private = {} private.n...
Cuarno Vile
4
было так ;void set_http_ver(RESPD* ptr, char* version, uint32_t length) // example: 'RTSP/1.1 ' set_http_ver: mov eax, [esp + 4] mov ecx, [esp + 8] ...
Mixail Frolov
5
А еще в перле можно уже @arr1 + @arr2?
Sergei Zhmylove
53
зачем же переименовывать ? чтобы кол-во участников возросло или вдруг IBM от этого снова на свифте начнет кодить ? Я не понимаю что страшного в том что свифт гавно, если это т...
Oleh Nerzh
10
@MrMiscipitlick А можешь макрос написать, который будет вычислять смещение относительно переданных меток? Просто .label1-.label2, и вернуть значение.
КТ315
35
здравствуйте. совершаю вот такую вещь: strcpy(line, (char)current_number); где current number — неподписанный шорт, line — массив чаров. ругань следующая: main.c:29:30: error...
Roberto's Ширгозиев
13
Где закоментить или что то прописать?
Alibek Кulseitov 🇰🇿
7
Карта сайта