Monday, August 12, 2013

Google Spreadsheet ImportHtml and ImportXML for Chinese Characters

Question:

( by KennethY )



When I tried to import some information using "importxml" from a Web page with Korean language to spreadsheet, it displayed as "????????"
does this function not support the Asian language? if it's the case, anyone could kindly share some idea how I could do the similar things ? Many Thanks!!


Solution:

May be you have not structured your XPath query properly...

Google Spreadsheet's functions ImportXML and ImportHTML does supports Asian language, you can see the following example.
Have a look at the following screenshot of Wikipedia's web page:




Have a look at the following screenshot of Spreadsheet:


I have the following formula in Cell A2:
=ImportHtml("http://en.wikipedia.org/wiki/Chinese_character";"table";0)


Have a look at the following screenshot of Spreadsheet:


I have the following formula in Cell A2:
=ImportXml("http://en.wikipedia.org/wiki/Chinese_character";"//table[@class='wikitable']//span[@lang='zh-tw']")


I hope the above solution will help you, and if you need more help then please do comment below on this blog itself, I will try to help you out.


I also take up private and confidential projects:
If this blog post was helpful to you, and if you think you want to help me too and make my this blog survive then please donate here: http://igoogledrive.blogspot.com/2012/09/donate.html 

Thanks,

2 comments:

  1. When I take data from tatoeba.org, it just becomes ???????. Do you have any way of solving this problem?

    ReplyDelete
    Replies
    1. What exactly you want to fetch from tatoeba.org ?

      Delete