169 похожих чатов

Good evening, colleagues! I need help with own technical question as: What

technologies can be used for parser, susceptible to any variation in website design (DOM)?

Today this question will seem strange to someone, but believe me, tomorrow it will be irreplaceable thing, if you support to beat it🔥

I open to discuss.

Thank you for the attention.

28 ответов

63 просмотра

You can use liberties like bs4 & scrapy for the same

Malware ( DM = BLOCK )
You can use liberties like bs4 & scrapy for the sa...

They're asking for a solution so that they don't have to rewrite their scraper when the website changes.

ᅠ ᅠ ᅠ Maksym-R. Автор вопроса
Malware ( DM = BLOCK )
You can use liberties like bs4 & scrapy for the sa...

Does it work even if the whole design of the website will change, am I correctly understand your point here?

ᅠ ᅠ ᅠ Maksym-R. Автор вопроса
Bread pup ▲⬤ ×▫︎
If the whole site changes, then no.

I need something that will work even the whole website will change the design. Selenium is working only with specific design version.

ᅠ ᅠ ᅠ Maksym-R. Автор вопроса
Malware ( DM = BLOCK )
Idk if that's even possible

It should be possible, at least, with NLP

+ might be illegal for some spesific websites

Malware ( DM = BLOCK )
Tine to use or train a LLM to do it for you

Why llm? There surely must be another type of model more applicable to the use case?

ᅠ ᅠ ᅠ Maksym-R. Автор вопроса

I bring them new clients, and twice as many, I don’t see a problem with unhappy requests to the site, it sounds funny compared to how many new clients would come to them.

ᅠ ᅠ ᅠ Maksym R.
I bring them new clients, and twice as many, I don...

A) read what they said again B) https://t.me/thedevs_chat/636436

ᅠ ᅠ ᅠ Maksym-R. Автор вопроса
Malware ( DM = BLOCK )
A) read what they said again B) https://t.me/thed...

If for such sites it is such a big money for sent requests, why do they position themselves as large sites with serious turnover, judging by what is written about them? Or are they so broke that they can't afford new clients? Do you believe this yourself?

ᅠ ᅠ ᅠ Maksym-R. Автор вопроса
Bread pup ▲⬤ ×▫︎
Can you guarantee it?

Of course, I worked with clients from Dubai, I know what I'm saying

ᅠ ᅠ ᅠ Maksym-R. Автор вопроса
Malware ( DM = BLOCK )
Tine to use or train a LLM to do it for you

You can check easily, what are they providing on international level, as I see they are online retailer specializing in consumer electronics, gadgets, and men's fashion. The company provides products in various other categories like home & garden, bags, baby & kids, health & beauty and more from brands like Apepal, Zanflare, Utorch, Apple and more, and so on? So they can't find money on requests in exchange for new clients, right?

ᅠ ᅠ ᅠ Maksym R.
Of course, I worked with clients from Dubai, I kno...

But those sites don't know it. And if you can prove what you say to them, it would be better to get to an agreement with the sites to get the data directly from them without scraping.

ᅠ ᅠ ᅠ Maksym R.
You can check easily, what are they providing on i...

Imagine if multiple scrappers make multiple requests to the site, it can add up quickly. And as I said before, those site owners can't know for sure if what you say is true.

I got a few things to say about this question, too… for one, a link to quora where you asked the same thing doesn't count as research, and I'd have to check SO guidelines, but I think they don't like when people use chatgpt or similar as the main/only source.

ᅠ ᅠ ᅠ Maksym-R. Автор вопроса
Bread pup ▲⬤ ×▫︎
I got a few things to say about this question, too...

Research is to show the amount of tools, which could be used. Does it makes sense to show each website of these tools if I provide the list of it?

ᅠ ᅠ ᅠ Maksym-R. Автор вопроса
Bread pup ▲⬤ ×▫︎
But those sites don't know it. And if you can prov...

I prove when they received money from the first client I received % from it, so I think this is a clear proof.

ᅠ ᅠ ᅠ Maksym-R. Автор вопроса

Похожие вопросы

Обсуждают сегодня

а через ESC-код ?
Alexey Kulakov
29
30500 за редактор? )
Владимир
47
Чёт не понял, я ж правильной функцией воспользовался чтобы вывести отладочную информацию? но что-то она не ловится
notme
18
У меня есть функция где происходит это: write_bit(buffer, 1); write_bit(buffer, 0); write_bit(buffer, 1); write_bit(buffer, 1); write_bit(buffer, 1); w...
~
13
Недавно Google Project Zero нашёл багу в SQLite с помощью LLM, о чём достаточно было шумно в определённых интернетах, которые сопровождались рассказами, что скоро всех "ибешни...
Alex Sherbakov
5
program test; {$mode delphi} procedure proc(v: int32); overload; begin end; procedure proc(v: int64); overload; begin end; var x: uint64; begin proc(x); end. Уж не знаю...
notme
6
Как передать управляющий символ в открытую через CreateProcess консоль? Собсна, есть процедура: procedure TRedirectThread.WriteData(Data: OEMString); var Written: Cardinal;...
Serjone
5
вы делали что-то подобное и как? может есть либы готовые? увидел картинку нокода, где всё линиями соединено и стало интересно попробовать то же в ddl на lua сделать. решил с ч...
Victor
8
Ребят в СИ можно реализовать ООП?
Николай
33
Подскажите пожалуйста, как в CustomDrawCell(Sender: TcxCustomGridTableView; ACanvas: TcxCanvas; AViewInfo: TcxGridTableDataCellViewInfo; var ADone: Boolean); получить наз...
A Z
7
Карта сайта