169 похожих чатов

Good evening, colleagues! I need help with own technical question as: What

technologies can be used for parser, susceptible to any variation in website design (DOM)?

Today this question will seem strange to someone, but believe me, tomorrow it will be irreplaceable thing, if you support to beat it🔥

I open to discuss.

Thank you for the attention.

28 ответов

66 просмотров

You can use liberties like bs4 & scrapy for the same

Malware ( DM = BLOCK )
You can use liberties like bs4 & scrapy for the sa...

They're asking for a solution so that they don't have to rewrite their scraper when the website changes.

ᅠ ᅠ ᅠ Maksym-R. Автор вопроса
Malware ( DM = BLOCK )
You can use liberties like bs4 & scrapy for the sa...

Does it work even if the whole design of the website will change, am I correctly understand your point here?

ᅠ ᅠ ᅠ Maksym-R. Автор вопроса
Link 🪈
If the whole site changes, then no.

I need something that will work even the whole website will change the design. Selenium is working only with specific design version.

ᅠ ᅠ ᅠ Maksym-R. Автор вопроса
Malware ( DM = BLOCK )
Idk if that's even possible

It should be possible, at least, with NLP

+ might be illegal for some spesific websites

Malware ( DM = BLOCK )
Tine to use or train a LLM to do it for you

Why llm? There surely must be another type of model more applicable to the use case?

ᅠ ᅠ ᅠ Maksym-R. Автор вопроса

I bring them new clients, and twice as many, I don’t see a problem with unhappy requests to the site, it sounds funny compared to how many new clients would come to them.

ᅠ ᅠ ᅠ Maksym R.
I bring them new clients, and twice as many, I don...

A) read what they said again B) https://t.me/thedevs_chat/636436

ᅠ ᅠ ᅠ Maksym-R. Автор вопроса
Malware ( DM = BLOCK )
A) read what they said again B) https://t.me/thed...

If for such sites it is such a big money for sent requests, why do they position themselves as large sites with serious turnover, judging by what is written about them? Or are they so broke that they can't afford new clients? Do you believe this yourself?

ᅠ ᅠ ᅠ Maksym-R. Автор вопроса
Link 🪈
Can you guarantee it?

Of course, I worked with clients from Dubai, I know what I'm saying

ᅠ ᅠ ᅠ Maksym-R. Автор вопроса
Malware ( DM = BLOCK )
Tine to use or train a LLM to do it for you

You can check easily, what are they providing on international level, as I see they are online retailer specializing in consumer electronics, gadgets, and men's fashion. The company provides products in various other categories like home & garden, bags, baby & kids, health & beauty and more from brands like Apepal, Zanflare, Utorch, Apple and more, and so on? So they can't find money on requests in exchange for new clients, right?

ᅠ ᅠ ᅠ Maksym R.
Of course, I worked with clients from Dubai, I kno...

But those sites don't know it. And if you can prove what you say to them, it would be better to get to an agreement with the sites to get the data directly from them without scraping.

ᅠ ᅠ ᅠ Maksym R.
You can check easily, what are they providing on i...

Imagine if multiple scrappers make multiple requests to the site, it can add up quickly. And as I said before, those site owners can't know for sure if what you say is true.

I got a few things to say about this question, too… for one, a link to quora where you asked the same thing doesn't count as research, and I'd have to check SO guidelines, but I think they don't like when people use chatgpt or similar as the main/only source.

ᅠ ᅠ ᅠ Maksym-R. Автор вопроса
Link 🪈
I got a few things to say about this question, too...

Research is to show the amount of tools, which could be used. Does it makes sense to show each website of these tools if I provide the list of it?

ᅠ ᅠ ᅠ Maksym-R. Автор вопроса
Link 🪈
But those sites don't know it. And if you can prov...

I prove when they received money from the first client I received % from it, so I think this is a clear proof.

ᅠ ᅠ ᅠ Maksym-R. Автор вопроса

Похожие вопросы

Обсуждают сегодня

Господа, а что сейчас вообще с рынком труда на делфи происходит? Какова ситуация?
Rꙮman Yankꙮvsky
29
А вообще, что может смущать в самой Julia - бы сказал, что нет единого стандартного подхода по многим моментам, поэтому многое выглядит как "хаки" и произвол. Короче говоря, с...
Viktor G.
2
30500 за редактор? )
Владимир
47
а через ESC-код ?
Alexey Kulakov
29
Чёт не понял, я ж правильной функцией воспользовался чтобы вывести отладочную информацию? но что-то она не ловится
notme
18
У меня есть функция где происходит это: write_bit(buffer, 1); write_bit(buffer, 0); write_bit(buffer, 1); write_bit(buffer, 1); write_bit(buffer, 1); w...
~
14
Добрый день! Скажите пожалуйста, а какие программы вы бы рекомендовали написать для того, чтобы научиться управлять памятью? Можно написать динамический массив, можно связный ...
Филипп
7
Недавно Google Project Zero нашёл багу в SQLite с помощью LLM, о чём достаточно было шумно в определённых интернетах, которые сопровождались рассказами, что скоро всех "ибешни...
Alex Sherbakov
5
Ребят в СИ можно реализовать ООП?
Николай
33
https://github.com/erlang/otp/blob/OTP-27.1/lib/kernel/src/logger_h_common.erl#L174 https://github.com/erlang/otp/blob/OTP-27.1/lib/kernel/src/logger_olp.erl#L76 15 лет назад...
Maksim Lapshin
20
Карта сайта