Page 1 of 1

Trouble Character set Import Wordpress

Posted: Wed Aug 02, 2006 4:06 am
by bigua
Hi all,

I got one trouble there import wordpress database,

for both charset, UTF-8 and ISO8859-1, and it's dont work..

some's character like "á, ã, é, ò" got bug..

My sql system structure:
character set latin1
lang=ptbr-iso-8859-1

And i use db - wp with prefix serendipity_

Thanks .

Re: Trouble Character set Import Wordpress

Posted: Wed Aug 02, 2006 10:33 am
by garvinhicking
Hi!

Which serendipity version are you using? Which charset did you configure in the s9y setup? "Native" or "UTF-8"? And which language are you using? Which PHP version are you using?

Is the WP blog saved in tables that have collations different than the ones of the s9y tables?

Maybe you could upload the SQL dump of your WP tables users, categories, posts, post2cat and comments? Then I could try to do the import on my machine and see if there could be problems? Actually the conversion should all happen properly.

Posted: Wed Aug 02, 2006 1:43 pm
by bigua
Hello, first of all I wanna said full thanks for help!

Here is the things you request:

-

wp_post2cat: http://rafb.net/paste/results/YBEZz326.html
wp_posts: http://rafb.net/paste/results/W6frgW26.html
wp_users: http://rafb.net/paste/results/NXogzG88.html
(i edit/deleted somes things for security and dont come spams)
wp_categories: http://rafb.net/paste/results/4KLmot29.html
wp_comments: http://rafb.net/paste/results/qhG4Z747.html
(i edit/deleted somes comments for its dont come spams)

-

So i tried with Native impoty UTF-8, deleted, later i try set native, and import ISO didnt work, i deleted all, configure it it again UTF-8 and try import with UTF-8, didnt work i deleted all, configure with UTF try import with ISO didnt work,.. so after i try alot types import and configure nothing work :(..

-

I running it in Debian-Sarge
Phpmyadmin 2.6.2,
Mysql 4.0.24,
PHP Version 4.3.10-16
Server API Apache 2.0 Handler
Serendipity-1.0.tar.gz

-
Thanks attention!

Posted: Wed Aug 02, 2006 4:23 pm
by garvinhicking
Hi!

You're welcome :)

What I can tell because of your SQL dumps is that WordPress saves its input in "UTF-8" format. And if you are running your serendipity in UTF-8 format (which you should do!) it should not become a problem.

For the import you should set the Charset in the importer to "UTF-8" as well, and basically it should all work alright.

Using that setting at least my first import tries worked well. If it still does not work for you, could you show us a dump of a serendipity_entries article where the characters are wrong?

Best regards,
Garvin

Posted: Thu Aug 03, 2006 3:46 am
by bigua
Hello

Well, i done with last setting .."UTF-8" and import "UTF-8"


i watch one thing interest:

The title of article works good, but the body of article dont work character...

serendipity_entries: http://rafb.net/paste/results/pAo7qh45.html

the characteres wrong in body,
In title its work fine!


I'm full happy and glad with your help !!

Thanks so much.

Posted: Thu Aug 03, 2006 2:45 pm
by garvinhicking
Hi!

Ah, now I think I know what's going on! It's the dang HTML entity translation. :-)

So, you should hopefully be fine if you just set the "convert html special characters" option to OFF?

Best regards,
Garvin

Posted: Fri Aug 04, 2006 3:23 am
by bigua
Hello, ...

I set it OFF and It's work fine!!!

Very thanks !!!!. (for the all support and respect!)

I will keep some money and put one book this your wish list!!!

Really you deservers the best!

Posted: Fri Aug 04, 2006 11:27 am
by garvinhicking
Hi!

You are welcome! Hope you have fun with Serendipity! :-) :-)

Best regards,
Garvin