fail to crawl a dynamic loading web page using scrapy-splash - web-scraping

Greetings. I am new to scrapy-splash.
i try to extract car information from following webpage by using scrapy-splash
https://www.lexusserramonte.com/new-inventory/index.htm
i tried a lot, just does not work. Make no difference compared to just using scrapy without splash plugin.
Just wondering anyone can extract information from that URL? if you can, please share your code.
much much appreciate

You don't need to use splash for this task.. all you need is to reverse engineer API calls.
All the needed info on cars can be found by this link https://www.lexusserramonte.com/apis/widget/INVENTORY_LISTING_DEFAULT_AUTO_NEW:inventory-data-bus1/getInventory?compositeType=new

Related

How to solve PageErrors when using pyppeteer for web scraping?

I'm trying to scrape data from this site https://b2b.10086.cn/b2b/main/listVendorNotice.html?noticeType=2 using pyppeteer. I wrote a for-loop running through each page and used "await page.click" to change pages. However, the script could only work for a few pages and then encountered "pyppeteer.errors.PageError: No node found for selector". Does anyone have any ideas on how to solve this problem? I have tried adding "await asyncio.sleep(1)" and user agent information, which did not work.
Thank you for your help in advance!

Linkedin sharing page keeps spinning

I’m trying to use the shareArticle API on a website for a client, whatever parameters I’m giving the share page keeps spinning, it seems to be a parsing issue but I’m not able to figure it out.I’ve also tried to use the share button API but it’s not better.
This is the URL I’m trying to share (but every URL from the site seems to be having the same issue) : http://www.urbanem.fr/formation-gestion-conflits-mediation-2/
Any idea what might be wrong?
Here’s the source code to generate the share button:
$postURLLinkedin = urlencode(home_url() . "?p=" . get_the_ID());
$linkedin_share = '<li><a class="sb-linkedin" href="http://www.linkedin.com/shareArticle?mini=true&url='.$postURLLinkedin.'&title='.urlencode($postTitle).'&source='.urlencode(home_url()).'&summary='.urlencode($postSummary).'" onclick="javascript:window.open(this.href,\'\', \'menubar=no,toolbar=no,resizable=yes,scrollbars=yes,height=600,width=600\');return false;" title="Partager sur Linkedin"><i class="fa fa-linkedin"></i></a></li>';
If you load the url in your linkedin profile and then make sharing works again. It's not the right thing to do but I is saving me yet.
We need a fix for this, as I pass me the same. The link you want to share are created dynamic. The share is loaded and never get to share. With all browsers I have the same problem.
I just checked and its working fine. Though I didnt share it, but I was at the last step of sharing.

fill out search on website and screen scrape result in r

this is my first post, so if my question is too vague or not clear, please tell me so.
I'm trying to scrape a website with news-articles for a research project. But the link to the modified search on that webpage won't work, because the intranet-authentication will spit out an error.
So my idea was, that I fill out the search form and use the resulting link to scrape the website.
Since my boss likes to work with R, he would like me to write an R-skript to do so, but I have no idea how to and haven't found anything working.
You need two packages: RCurl and XML.
The RCurl package is used for internet browsing. It can access HTML forms with _GET or _PUT arguments. So, with it you can login or fill out the any form.
The output from the server would be in HTML. If you want to grep the links, you can use XLM package. I helps to get any data form XML format.
But before start, you have to find out that is the search form in webpage (and that arguments should be used). The Firefox browser could be useful. You need two add-ins: Live HTTP header and Firebug. With those add-ins you can inspect webpage much more easier.
I know that it did not solve you problem, but I could not say any more, since it deepens on particular situation and webpage structure. I believe that the tool I have mentioned is quite enough to achieve that you want.
Bet regards.

Set a tag to track links from a visual basic app in google analytics

I hope I can explain myself.
See. I have this little program where I put a link to my site, what I want to know if there's some way to add a tag into the URL so Google Analytics can count the amount of visitors coming from that program.
Like when you parse the GET in php.
something like http:\\www.stackoverflow.com\?something_to_google_analytics_to_read
If this is possible, I assume that I need also to configure that Tag into Analytic's, or?
Thanks
As #SLaks pointed, I can find a step by step guide for create what I wanted in:
https://support.google.com/analytics/answer/1033867?ref_topic=1032998
It is called Campaigns.
Thanks

Url Routing/Rewriting in ASP.NET

We want to add some more data tracking to our website and we want to attach a keyword onto the end of a url.
I have something like this
http://www.samplesite.com/t/1/video/123456
http://www.samplesite.com/t/2/video/123457
http://www.samplesite.com/t/3/video/123458
and I want it to look like this
http://www.samplesite.com/t/1/video/123456/funny
http://www.samplesite.com/t/2/video/123457/sports
http://www.samplesite.com/t/3/video/123458/informative
I've been searching the internet trying to figure out how to do this and I haven't really come up with anything. I'm able to paste those urls in and get to the right page but I don't actually know how to make those pages have those urls when they load. I realize this question is pretty open ended but I'm basically looking for some starting points on some more searches as I don't think I've been phrasing things the best. Thanks in advance.
If you're using IIS 7 or 7.5 you can use the Microsoft URL Rewriter module, otherwise there are some third party ones for IIS6 and 5.
http://www.iis.net/downloads/microsoft/url-rewrite
There are several examples at:
http://www.iis.net/learn/extensions/url-rewrite-module/using-url-rewrite-module-20
So are you saying you go to one of those URLs and it shows what you expect, but the URL is changing (so it doesn't have the suffix)?
Have you tried using urlrewriter?
A few references :
http://dotnetguts.blogspot.in/2008/07/url-rewriting-with-urlrewriternet.html
http://www.tutorialized.com/tutorial/URL-Rewriting-in-ASP.NET-using-URLRewriter.Net/38861
http://www.addedbytes.com/articles/for-beginners/url-rewriting-for-beginners/
Hope this helps you.

Resources