Page 1 of 2
German characters again
Posted: Sun Feb 13, 2005 9:41 pm
by holger
Sorry everybody but I am a newbie to s9y and I went through all the postings here but I did not find the answer to my question or did not quite understand what was told.
My problem:
I just installed Serendipity v.0.7.1 and it is running stable so I did not want to make any changes to my installation at that time.
Now I want to import my old blog entries from blogg.de. Everything works fine. Only the German characters like ü,ö,ä,ß and so on were not imported correctly. I have checked the installed character sets. Both are the same (charset=ISO-8859-1).
Is there any patch available to correct this problem or what else can I do to solve it without making too much changes to my running installation?
Thank you in advance for your support.
Kind regards
Holger
Re: German characters again
Posted: Mon Feb 14, 2005 9:26 am
by garvinhicking
What does "were not imported correctly" mean exactly?
In Serendipity 0.8 we have made improvements to the RSS import so that it pays more attention to UTF-8 formatted feeds.
As you do not want to upgrade to 0.8, here's what I propose instead.
1. Edit your bundled-libs/Onyx/RSS.php file
2. Go to line 83 of that file.
3. Find the block with this code in around those lines:
Code: Select all
xml_set_object($this->parser, $this);
xml_parser_set_option($this->parser, XML_OPTION_CASE_FOLDING, false);
xml_parser_set_option($this->parser, XML_OPTION_TARGET_ENCODING, LANG_CHARSET); // INSERT THIS LINE!!!
xml_set_element_handler($this->parser, 'tag_open', 'tag_close');
xml_set_character_data_handler($this->parser, 'cdata');
The line to be inserted has been marked with "// INSERT THIS LINE".
Now the RSS import library should be able to properly import code according to your target encoding (ISO-8859-1)...
HTH,
GArvin
Re: German characters again
Posted: Mon Feb 14, 2005 1:11 pm
by holger
Hello GArvin,
thank you very much for your reply.
Very Happy
To describe the problem more specific here is what happens:
The German charcters ä,ü,ö,ß will be replaced during the import with the sign "?" and the following two sometimes three letters will be deleted totally.
The problem still remains even after following your solution above.
So I think it will be the best to install Serendipity 0.8 although I hoped that I could avoid it.
Can you please help me again by telling me where I may find the newest version in a Windoofs *.zip version?
Thank you in advance.
Kind regards
Holger

Re: German characters again
Posted: Mon Feb 14, 2005 2:48 pm
by garvinhicking
That's strange - the patch you've made to 0.7 is basically what we've done in 0.8, so upgrading won't help.
Can you give me the URL to your RSS feed, then I'll try if I can import it?
Regards,
Garvin
German characters again
Posted: Mon Feb 14, 2005 3:07 pm
by holger
Hello GArvin,
please try
http://pt.blogg.de/rss.xml for importing.
Thanks a lot.
Kind regards
Holger
Re: German characters again
Posted: Mon Feb 14, 2005 6:04 pm
by garvinhicking
I'll look at it first thing tomorrow.
Regards,
Garvin
Re: German characters again
Posted: Tue Feb 15, 2005 12:26 pm
by garvinhicking
The problem was that your feed is in ISO-8859-1 format, which is usually unexpected as all RSS feeds by RFC shall be in UTF-8 encoding.
You can fix this easiest by editing your serendipity_rss_exchange.inc.php file and remove all utf8_decode() function calls.
In current CVS this will be solved in a more convenient way so that you can specify the input charset language of your RSS feed.
Regards,
Garvin
Re: German characters again
Posted: Tue Feb 15, 2005 1:14 pm
by holger
Hi GArvin,
thank you for this hint. I will try it a.s.a.p. and keep you informed about the results.
Kind regards
Holger
Sorry for posting it as a new topic. Will you please be so kind and fix this problem for me? I do not have the rights to delete it by myself. Thanks in advance.
Re: German characters again
Posted: Mon Feb 21, 2005 1:18 pm
by holger
Hello Garvin,
...puhh, this seems to become a never ending story.
I think I overrated my capability. Whatever I deleted in the serendipity_rss_exchange.inc.php file is causing nothing than error messages.
Because I am not familiar with programming I do honestly not know where the utf8_decode() function calls starts nor ends.
May I ask you a last time to give me some help and publish the file as it should look like?
If this will not possible for you I will be patient and wait for version 0.8 to become published.
Sorry for beeing so incompetent. Thank you in advance.
Kind regards
Holger

Re: German characters again
Posted: Mon Feb 21, 2005 1:43 pm
by garvinhicking
Hi Holger!
Na klar, das ist kein Problem. Ich würde Dir die Datei gerne per Mail schicken damit sie hier per copy+paste nicht kaputt geht. Schreib mir mal eine E-Mail an "mail (at} garv.in" - dann schick ich dir die Datei zurück.
Grüße,
Garvin
Re: German characters again
Posted: Thu Feb 24, 2005 3:09 pm
by holger
Hi Garvin,
vielen Dank für das großzügige Angebot.
Leider funktioniert die Mailadresse "mail (at} garv.in" nicht oder ich habe da was falsch interpretiert
Darum hier die meine "holger.ahlers[at]gmail.com".
Nochmals vielen Dank.
Gruß
Holger
Re: German characters again
Posted: Sat Feb 26, 2005 11:50 pm
by garvinhicking
So, ich hab's dir gerade geschickt und ich hoffe es kommt an!
Grüße,
Garvin
Re: German characters again
Posted: Mon Feb 28, 2005 8:58 am
by holger
Hi Garvin,
vielen lieben Dank für all Deine Mühe.
Ich habe es gleich ausprobiert aber leider

immer noch der gleiche Effekt, die Umlaute werden nicht konvertiert.
Ich denke, wir sollten es dabei belassen und ich lege ein wenig Handarbeit an, um die Umlaute anzupassen. Ansonsten warte ich eben auf Version 8.0
Nochmals vielen Dank. Es macht mir Spaß, mit Eurer Software zu arbeiten, zumal bei dem tollen Service, der hier geboten wird.
Gruß
Holger
Posted: Mon Feb 28, 2005 10:12 pm
by Wolfgang
Ich habs auch nicht geschafft, der 0.8er snapshot hat das Problem dann aber sofort beseitigt. Man merkt gar nicht, dass es noch keine endgültige Version ist. Vielleicht gehst Du einfach mal aufs Glatteis mehr als einbrechen kannst Du ja nicht

German charcters again
Posted: Tue Mar 01, 2005 10:26 am
by holger
Hi Wolfgang,
Danke für den Hinweis.
Die Frage ist nur, wo finde ich denn diesen Snapshot? Kannst Du mir da weiterhelfen?
Danke und Gruß
Holger