Google is not indexing images - xml-sitemap

Hi
Everyday my scripts submit a xml to Google, containing new pages as well the images associated to them.
The pages are indexed ok but not the images.
For example, if I type in Google Web Search "Camisa Malha Cavalinhos", the first result is https://www.ebraz.co/listing.php?id=1052, which is ok.
But when I turn to Image Search, no results from my site are returned.
Robots.txt is not disallowing the images folder.
Below is the xml (headers [no longer] omitted).
What can I possibly be doing wrong?
Please ask for more details if you need. Thanks.
<?xml version='1.0' encoding='UTF-8'?>
<urlset xmlns='http://www.sitemaps.org/schemas/sitemap/0.9'
xmlns:xhtml='http://www.w3.org/1999/xhtml'
xmlns:image='http://www.google.com/schemas/sitemap-image/1.1'>
<url>
<loc>https://www.ebraz.co/listing.php%3Fid%3D1052%26title%3DCamisa+Malha+Cavalinhos</loc>
<image:image>
<image:loc>https://www.ebraz.co/upload/o_19la610uf1hpi1cuc11n2e801qc915.jpg</image:loc>
<image:title>Camisa Malha Cavalinhos</image:title>
</image:image>
</url>
</urlset>

On the page: https://support.google.com/webmasters/answer/178636?hl=en
You will notice this one begins with the xml version line. Also it doesn't have the xhtml line in it. I would modify it to be identical to this example from google. I have previously seen non-identical versions having issues.
<?xml version="1.0" encoding="UTF-8"?>
<urlset xmlns="http://www.sitemaps.org/schemas/sitemap/0.9"
xmlns:image="http://www.google.com/schemas/sitemap-image/1.1">
<url>
<loc>http://example.com/sample.html</loc>
<image:image>
<image:loc>http://example.com/image.jpg</image:loc>
</image:image>
<image:image>
<image:loc>http://example.com/photo.jpg</image:loc>
</image:image>
</url>
</urlset>

Related

JavaFX Program isnt working when trying to compile with FXML-File [duplicate]

This error,
The processing instruction target matching "[xX][mM][lL]" is not allowed
occurs whenever I run an XSLT page that begins as follows:
<?xml version="1.0" encoding="windows-1256"?>
<xsl:stylesheet version="1.0" xmlns:xsl="http://www.w3.org/1999/XSL/Transform">
<xsl:include href="../header.xsl"/>
<xsl:template match="/">
<xsl:call-template name="pstyle"/>
<xsl:call-template name="Validation"/>
<xsl:variable name="strLang">
<xsl:value-of select="//lang"/>
</xsl:variable>
<!-- ////////////// Page Title ///////////// -->
<title>
<xsl:value-of select="//ListStudentFinishedExam.Title"/>
</title>
Note: I removed any leading spaces before the first line, but the error still occurs!
Xerces-based tools will emit the following error
The processing instruction target matching "[xX][mM][lL]" is not allowed.
when an XML declaration is encountered anywhere other than at the top of an XML file.
This is a valid diagnostic message; other XML parsers should issue a similar error message in this situation.
To correct the problem, check the following possibilities:
Some blank space or other visible content exists before the <?xml ?>
declaration.
Resolution: remove blank space or any other
visible content before the XML declaration.
Some invisible content exists before the <?xml ?>
declaration. Most commonly this is a Byte Order Mark
(BOM).
Resolution:
Remove the BOM using techniques such as those suggested by the W3C
page on the BOM in HTML.
A stray <?xml ?> declaration exists within the XML content.
This can happen when XML files are combined programmatically or
via cut-and-paste. There can only be one <?xml ?> declaration
in an XML file, and it can only be at the top.
Resolution: Search for
<?xml in a case-insensitive manner, and remove all but the top XML
declaration from the file.
Debug your XML file. Either there is space or added extra or fewer tags.
For better understanding build the project through the command line. Windows: gradlew build
In my case, AndroidManifest.xml has a blank space at the very first line
<Empty Row> // This Creates the issue
<?xml version="1.0" encoding="utf-8"?>
There was auto generated Copyright message in XML and a blank line before <resources> tag, once I removed it my build was successful.
just remove this line: <?xml version="1.0" encoding="utf-8"?> because this kind of error only come because of this line or you might also check the format of your line according the mentioned line in this answer.
I had a similar issue with 50,000 rdf/xml files in 5,000 directories (the Project Gutenberg catalog file). I solved it with riot (in the jena distribution)
the directory is cache/epub/NN/nn.rdf (where NN is a number)
in the directory above the directory where all the files are, i.e. in cache
riot epub/*/*.rdf --output=turtle > allTurtle.ttl
This produces possibly many warnings but the result is in a format which can be loaded into jena (using the fuseki web interface).
surprisingly simple (at least in this case).
Another reason of the above error is corrupted jar file. I got the same error but for Junit when running unit tests. Removing jar and downloading it again fixed the issue.
in my case was a wrong path in a config file: file was not found (path was wrong) and it came out with this exception:
Error configuring from input stream. Initial cause was The processing
instruction target matching "[xX][mM][lL]" is not allowed.
For PHP, put this line of code before you start printing your XML:
while(ob_get_level()) ob_end_clean();
It's worth checking your server's folders to see if there's a stray pom.xml hanging around.
I found that I had the problem everyone else described with a malformed pom.xml, but in a folder that I didn't expect to be on the server. An old build was sticking around unwelcome D:
For my case, the tab is the trouble maker. Replace the tab with blank should resolve the issue

RSS '<link>' Tag in Jade

For my blog site, I use Node.js, Express, and Jade for templating.
I built a simple rss.jade template for my RSS feed, and am having problems getting the required tag to work nicely.
I believe, in older versions of Jade you could use:
link http://example.com
Now, it is enforcing that 'link' is self-closing like:
link(rel='stylesheet', href='style.css')
Is there any other solution rather than plain-text like:
|<link>http://example.com</link>
Are you sure you defined the right doctype?
With the doctype xml you can produce this example:
doctype xml
rss(version="2.0")
link.
http://example.com
and get the output you wished to see:
<?xml version="1.0" encoding="utf-8" ?>
<rss version="2.0">
<link>http://example.com</link>
</rss>

How do I use XML and CSS together to make a "fancy" SiteMap?

I have created a sitemap that conforms to the SiteMap protocol. This is in XML format. How would i apply CSS to create a fancy site map similar to Fraunhofer?
The xml was made by an online sitemap generator, and has "url" tags. Each tag represents a page, but nothing is inside that tag except info about that page (no url tags within tags, to show a tree like setup), so how would i even show what links are level 1, 2, or 3? Also, since this format only has one main tag called "URL", i cant style different sites differently.
EDIT: This is what some of my generated XML looks like:
<?xml version="1.0" encoding="UTF-8"?>
<urlset
xmlns="http://www.sitemaps.org/schemas/sitemap/0.9"
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
xsi:schemaLocation="http://www.sitemaps.org/schemas/sitemap/0.9
http://www.sitemaps.org/schemas/sitemap/0.9/sitemap.xsd"
>
<url>
<loc>http://buytadiran.com/</loc>
<lastmod>2014-03-04T15:21:12-06:00</lastmod>
</url>
<url>
<loc>http://buytadiran.com/index.php?option=com_content&view=article&id=47&Itemid=54</loc>
<lastmod>2014-03-04T15:21:12-06:00</lastmod>
</url>
</urlset>
Id like to have a tree like set up similar to what i linked, without the folds, where a top level page is shown in green or something, and sub pages are shown indented under it with different coloring...
Is it even possible to make a sitemap that is structured this way, and still conform to the SiteMap protocol, or should i be looking at making a seperate sitemap thats "user friendly", that i'm not submitting to google?
Thanks!

How can I keep XHTML markup in the RSS output from Yahoo Pipes

I am using yahoo pipes to aggregate a variety of rss feeds (and make some simple transformations, etc). However, the XHTML content of my feeds is stripped away by the aggregation process. The problem seems to have nothing to do with my pipes code, and simply be a result of how Yahoo Pipes transforms the input RSS I am feeding it into the output RSS it spits out -- though both are apparently RSS, they are quite different XML files.
So, for example, when the "input" rss looks like this:
<?xml version="1.0" encoding="UTF-8"?>
<feed>...
<entry>
...
<title>...</title>
<content type="xhtml">
HTML CONTENT HERE
</content>
</entry>
...
</feed>
The output looks like this:
<rss xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:media="http://search.yahoo.com/mrss/" xmlns:yt="http://gdata.youtube.com/schemas/2007" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0">
<channel>
<title></title>
<description></description>
<link>
</link>
...
<item>
<title>...</title>
...
<description>
NON HTML CONTENT HERE
</description>
...
</item>
</channel>
</rss>
It seems that the problem is that the contents of the <content> tag in my input are being converted into a <description> tag in the yahoo rss output, and from what I looked up, the <description> tag does not support markup within it, so it makes sense that that content is plain text.
My problem is that I can't find any documentation of why Yahoo is making this transformation or how to avoid it. It is nothing that I can find in my pipe, but on the other hand, I can't find a way to access an un-transformed version of my pipe's output. It seems I must be missing something because obviously people use embedded HTML in RSS all the time and I can't imagine it's all being squashed in every pipe out there.
Here is the link to a minimal test case in which I pull in a single feed and output it again with Yahoo pipes: http://pipes.yahoo.com/pipes/pipe.info?_id=5302fba3cc2d02f7a87ffdded87ce627
Can anyone tell me how to modify the above test pipe so that it will spit out content with html in it? Alternatively, can anyone explain if/why this is impossible to do?
AFAIK you can't get raw feed output as XML - Yahoo! shoehorns your output into specific standard RSS fields, as you noticed. The best option that I can think of is to get your output as json (a javascript object), as the json preserves all the fields in the input:
http://pipes.yahoo.com/pipes/pipe.run?_id=5302fba3cc2d02f7a87ffdded87ce627&_render=json
Most modern processors will take json output in addition to RSS.

I have question about RSS feed

I am stuck on this. My project is to do this..This week you will be creating a theme for a web development company.
* You can use http://www.1kbgrid.com/ for a base design.
* Come up with a color scheme that will match this logo.
* Decide what your top links should be. Create an RSS feed for these top pages.
* Create a report that states the scope of the project (why you are doing it), the top level pages, why you picked the color scheme,etc.
ok I have done and rss thing and this is what it looks like
<?xml version="1.0" encoding="iso-8859-1"?>
<rss version="2.0">
<channel>
<title>Web design software</title>
<link></link>
<description>Has different soft ware and prices.</description>
<item>
<title>Adobe Dreamweaver</title>
<link>http://shopping.yahoo.com/search?p=web%20design</link>
<description>This tells you how much it cost and what it is
for.</description>
</item>
<item>
<title>Frontpage 2003 upgrade</title>
<link>
http://www.softwaremedia.com/microsoft/frontpage/frontpage-
2003-upgrade.html?ovchn=PGR&ovcrn=39202323&ovtac=CMP&ovcpn=frontpage-2003---upgrade</link>
<description>THis is to help build a better web
site.</description>
</item>
<item>
<title>WebPlus X4 Website Maker Mini Box</title>
<link>
http://accessories.us.dell.com/sna/productdetail.aspx?sku=A3310055&cs=04&c=us&l=en&dgc=SS&cid=52102&lid=1342490</link>
<description>It helps you to make websites</description>
</item>
</channel>
</rss>
what i am stuck on is how to decide what my top link should be.Create an RSS feed for those top pages. Ok the question is am I suppose to link the rss feed or if that is even possible. I am totally lost on this question. Thank you for ready this.
You generally link to RSS documents from your main site by using the <link> tag. It should look something like this
<link rel="alternate" type="application/rss+xml" title="Top Web design software Feed" href="http://LINKTOMYRSSFILE" />
If you site is a dynamic site, you'll need to replace LINKTOMYRSSFILE with the link to the file that generates your RSS. If it's a static file, just save this file in your server and point it to that.
Also, your main channel link should point to the link of the page that displays all this content.
EDIT
First off, you should use the W3C service to validate your feed. They point out what's wrong/best practices and tells you how you can fix them (use the help link next to the error)
Make sure the empty <link></link> tag contains a url to your site
To answer your questions in the comments
Missing atom:link with rel="self"
According to the RSS Advisory Board's
Best Practices Profile, identifying a
feed's URL within the feed makes it
more portable, self-contained, and
easier to cache. For these reasons, a
feed should contain an atom:link used
for this purpose.
via : http://validator.w3.org/feed/docs/warning/MissingAtomSelfLink.html
You can fix this by changing your <rss> tag to
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
And having the following above items
<atom:link href="http://LINKTOFEED" rel="self" type="application/rss+xml"/>
Line 12, column 2: item should contain a guid element
It's recommended that you provide the
guid, and if possible make it a
permalink. This enables aggregators to
not repeat items, even if there have
been editing changes.
A frequently asked question about
s is how do they compare to
s. Aren't they the same thing?
Yes, in some content systems, and no
in others. In some systems, is
a permalink to a weblog item. However,
in other systems, each is a
synopsis of a longer article,
points to the article, and is
the permalink to the weblog entry.
So in your case adding the following to each of your items will fix it
<guid isPermaLink="false">LINKTOSOFTWARE</guid>
Replace LINKTOSOFTWARE with the links in your <link></link> tags.
I corrected your mistakes and your feed validates fine. You just need to fix these errors.
You can follow this tutorial on how to add RSS feeds to your pages : https://developer.mozilla.org/en/RSS/Getting_Started/Syndicating
It's as simple as adding an HTML tag into your page.

Resources