Node Alias issue in Google? - drupal

My website pages all show up in Google search results as *.com/node/#### instead of the alias name.
I need help fixing the naming convention to show up in Google via alias, as they are also listed in the sitemap.xml. I'm not sure if this needs to be fixed from Drupal or through NGINX.
Note that I don't have this issue in Bing.

The reason for this can be that Google bots have indexed your urls with node/nid formats.
But its weird that you don't have this issue in Bing. The only reason I can think of for this anamoly is that Bing crawled the right urls just by chance or Bing has a rule which says when there are aliases use more meaningful ones(which can't be true for Bing :P)
Anyways go ahead and install http://drupal.org/project/globalredirect module. This should solve your problems as it creates 301 redirects for all you aliased paths which is very SEO friendly.

It could be that google found your page while you had aliasses disabled, and Bing found it later when the aliasses were enabled.
If you want them removed you can go to google webmaster tools. http://webmaster.google.com
There you can upload a new sitemap and remove URL's that you no longer use.

Related

Does Google Search Console indexing the nonwww version hurt my ranking if my site is using the www version?

Not sure if this is the right forum to post a question like this. My site at https://www.usahazmat.com is setup in Analytics using www and Search Console is setup with a connection to my analytics account. I assumed when my site is indexed, it's indexed using the www version of my domain.
When I use the site command site:www.usahazmat.com, I only see about 1600 pages, but when I do site:usahazmat.com (nonwww), I see 6500 pages.
I have two questions:
Why is search console indexing the nonwww version when everything is setup for www
With Google indexing the nonwww version, is this hurting my ranking? And if so, what would be the best route to fix this?
This covers a few areas.
What you specify in Google Analytics his no effect on ranking. What you see in Google Analytics is based on where you place your tracking code.
What you register with Google Search Console has no effect on ranking. The way you register controls what you see. So if you register the www version, you only see data on that.
It looks like you redirect users to the www version of your site (good). Use that for everything
Did you change from none www to www. That could explain the site: search change. But you will fins better data if you look at your site in the Google Search Console.

How to list all pages in Google search results?

My website was recently hacked and Google mentioned "This site may be hacked" so I removed the entire wordpress website and changed server. I installed a new wordpress website however, Google is still crawling the old pages. It runs into 20+ Google pages meaning there's over 200 links generated by the hack.
Now, I would like to do a 301 Redirect to all this links using htaccess so Google cache faster and remove these links.
How do I list all this links displayed in the searched result? or is there a better way to do? Yes, I have asked Google for Review but they said it will take several weeks :(
You can see all youre indexed pages by writing this in Google search
site:example.com
Just write youre domain name instead of example.com

Google listed a blog post with https and I don't know why?

Two days ago we posted a new blog on a site with the aim of being picked up for the search term "live comedy in chippenham". It’s been indexed by Google and we’re now 2nd in the results for the search query. The bad news is that for some reason the post has been indexed as a https URL so all browsers give a warning when the link is clicked.
Firefox gives this error:
The owner of www.neeld.co.uk has configured their website improperly. To protect your information from being stolen, Firefox has not connected to this website.
The host has confirmed that it's not a server config error and we have other posts and pages on the site that are being indexed correctly. We're using WordPress and the Yoast plugin. I can't see anywhere in Webmaster Tools that could be causing the problem.
Can anyone offer any advice please? If you search Google for "live comedy in chippenham" you'll see the issue (it's the link https://www.neeld.co.uk/live-comedy-in-chippenham/)?
It's a really strange one but something I've experienced before.
It has mostly likely been caused by an external link to the page using https protocol which Google has followed before indexing the page. Google are very keen to index https pages at the moment so we might start seeing this kind of issue more often.
There's not a lot you can do other than wait for Google to realise their mistake and list the correct URL in the SERPS. You can help speed this along with a canonical link (which I can see is there), XML sitemap (which you've got) and a server level redirect of https to http.
Do not try to remove the page in Webmaster Tools as this won't have the desired effect and will stop Google reindexing the page properly.
Hope this helps.

Will a Sitemap in localhost create duplicate content issue?

For my Wordpress.org site I use the Google Sitemap Generator plugin by Arne B. While in localhost I activated the plugin and it works.
I usually update my website in localhost and then upload the database to my webhost. So now I am wondering if Google search results will now enter both urls below?? Reason I am asking is because I am afraid Google will consider this as duplicate content.
http://127.0.0.1/beef-recipe-1/
http://www.actual-website.com/beef-recipe-1/
Google can't access your localhost (127.0.0.1) so it will most likely ignore those URL's.
If you are afraid of the above situation the best thing you can do is to delete all the previous sitemaps and re-generate a new one while your site is online. By going to Google's webmasters tools Resubmit the sitemap if necessary and crawl your websites main domain link for e.g: mydomainname.com and let Google crawl all direct links from associated to the homepage.
This way you will not lose rankings on Google while it may become a helping factor to your website.
Cheers!

Google URL Crawl error 404 - domain appending to end of URL

I recently built and published my Wordpress site at www.kernelops.com and submitted it to the google index and webmaster tools. Today I logged into webmaster tools and found 60 URL errors all with the same type of issue. The base domain address www.kernelops.com is being appended to all my sites page, category, and post URLs. An example of the failed URL looks like this:
http://www.kernelops.com/blog/www.kernelops.com
Google Webmaster Tools indicates that this weird link is originating from the base url "http://www.kernelops.com/blog" which obviously means the issue is on my end. My Wordpress permalink settings are set to use the post-name; I'm not sure if that could be causing this, i.e.:
http://www.kernelops.com/sample-post/
I can't seem to find any help resolving this weird issue with google searches and thought someone here may be able to point me in the right direction.
The Wordpress plugins that would potentially affect the site's URLs are the following:
All in One SEO
XML-Sitemap
But I can't see any sort of setting within these plugins that would be causing this type of issue.
Any ideas would be greatly appreciated - thanks in advance!
This is a long shot, but it may be happening if the Google crawler picks up a link that seems like a relative path and attempts to append it to the current directory. It's highly unlikely that Google would have such a bug, but it's not impossible either.
The closes thing I could find that may be considered a relative path is this:
<div class="copyright">
...
Kernel, Inc.
...
</div>
I doubt that this is the problem, but it may be worth fixing it.
Now, there is yet another possibility and that's if the website serves slightly different content depending on the User Agent string. When Google presents your website with a User Agent string, the SEO plugins detects it and tries to optimize things in order to improve your ranking (not familiar with that plugins, so I don't know what it does exactly). There may be a bug in the SEO plugin that will cause the www.kernelops.com URL to look like a relative path or to actually construct that faulty URL somehow.
You can possibly test this by setting the user-agent string in your browser (e.g. FireFox's user-agent switcher) to Googlebot's user-agent string and test what happens when you visit your website. Look at the page source that you receive and look for any links that might look like the one Google is finding.
However, if the SEO tool is smart enough, it will "realize" that your IP doesn't match one of the valid IPs for Googlebot and it will not make the modifications.

Resources