Back to Question Center
0

Uchwepheshe we-Semalt uchaza ukuthi ungayifaka kanjani iwebhusayithi ngeSobho elihle

1 answers:

Kukhona idatha eminingi evame ukwelinye icala i-HTML. Kumshini wekhompiyutha, ikhasi lewebhu lingumxube wezimpawu, izinhlamvu zombhalo, kanye nendawo emhlophe. Into yangempela esiya kuyo ukuthola ikhasi lewebhu kuphela okuqukethwe ngendlela efundwayo kithi. Ikhompiyutha ichaza lezi zakhi njengamathegi we-HTML. Isici esihlukanisa ikhodi eluhlaza kusuka kwedatha esiyibonayo isofthiwe, kulokhu, iziphequluli zethu. Amanye amawebhusayithi afana ne-scrapers angasebenzisa lo mqondo ukuze ahlaziye okuqukethwe kwewebhu futhi alondoloze ukuze asetshenziswe kamuva.

Ngolimi olucacile, uma uvula idokhumenti ye-HTML noma ifayela lomthombo wekhasi elithile, kuzotholakala ukuthola okuqukethwe okwamanje kuleyo website ethize. Lolu lwazi luzoba endaweni ephathekayo kanye nekhodi eminingi. Inqubo yonke ihilela ukusebenzisana nokuqukethwe ngendlela engahlelekile. Noma kunjalo, kungenzeka ukwazi ukuhlela lolu lwazi ngendlela ehlelekile futhi uthole izingxenye eziwusizo kuzo zonke ikhodi.

Ezimweni eziningi, ama-scrapers awenzi umsebenzi wawo ukuze afinyelele i-string ye-HTML. Ngokuvamile kunenzuzo yokugcina wonke umuntu ozama ukufinyelela. Ngokwesibonelo, abantu abenza imisebenzi ethile yokuthengisa ye-intanethi bangase badinge ukufaka izintambo eziyingqayizivele njengomyalo-f ukuthola ulwazi kusuka kukhasi lewebhu. Ukuqedela lo msebenzi emakhasini amaningi, ungadinga usizo hhayi nje amandla omuntu. Ama-web scrapers yiwona ma-bots angakwazi ukufaka iwebhusayithi nge amakhasi angaphezu kwezigidi esikhathini samahora. Inqubo yonke idinga indlela elula yohlelo. Ngezinye izilimi zokuhlela njenge-Python, abasebenzisi bangakwazi ukukhokha abanye abaqhamukayo abangakwazi ukufaka idatha yewebhusayithi bese bayilahla endaweni ethile.

Ukuqothula kungase kube yindlela enobungozi kumawebhusayithi athile. Kukhona izinto eziningi ezikhathazayo ezihambisana nokukhipha umthetho ngokusemthethweni. Okokuqala, abanye abantu bacabanga ukuthi idatha yabo iyimfihlo futhi iyimfihlo. Lokhu kwenzeka ukuthi izinkinga ze-copyright, kanye nokuvuza kokuqukethwe okwehlukile, kungenzeka uma kwenzeka ukukhishwa. Kwezinye izimo, abantu balayisha i-website yonke yokusebenzisa ungaxhunyiwe ku-intanethi. Ngokwesibonelo, esikhathini esedlule, kuvele icala le-Craigslist yewebhusayithi ebizwa nge-3Taps. Leli sayithi lisaba okuqukethwe kwewebhsayithi nokuphinda kubhale kabusha uhlu lwezindlu ezigabeni ezihlukanisiwe. Kamuva bahlala ne-3Taps bekhokha u- $ 1,000,000 kumawebhusayithi abo wangaphambili.

BS iyisethi yamathuluzi (ulimi lwe-Python) njengemoduli noma iphakheji. Ungasebenzisa isobho esihle ukukhipha iwebhusayithi kusuka kumakhasi wedatha kuwebhu. Kungenzeka ukuthi uhlakaze isayithi bese uthola idatha efomini ehlelekile efana nomkhiqizo wakho. Ungadlulisa i-URL bese usetha iphethini ethize kubandakanya ifomethi yethu yokuthumela. Ku-BS, ungathumela kumafomethi ahlukahlukene njenge-XML. Ukuze uqalise, udinga ukufaka inguqulo ehloniphekile ye-BS futhi uqale ngezingqinamba ezimbalwa ze-Python. Ulwazi lokuhlela lubalulekile lapha.

December 7, 2017
Uchwepheshe we-Semalt uchaza ukuthi ungayifaka kanjani iwebhusayithi ngeSobho elihle
Reply