Page 1 of 1

Plugin request import html files

Posted: Tue Oct 11, 2005 11:19 pm
by Guest
For about eight months I've been writing a blog on my site. It was simply html text tossed onto separate pages on my site. Now that I'm up and running on Serendipity Blog, I want to import all those pages into my Serendipity Blog??

Easy? Hard? Heinous? A breeze?
Let me know.
Thanks,
Jeff

Posted: Wed Oct 12, 2005 2:17 am
by Col. Kurtz
The reason that keeps me from using s9y from my own diary is something similar. my diary´s adress is http://boogada2.diaryland.com and I have no idea how to get those old entries to s9y without spending 2 days of copy and paste. any suggestions?


p.s. dont mind the design, it looks better with the surounding (full) website..

Posted: Wed Oct 12, 2005 11:52 am
by garvinhicking
The problem is of course, that HTML-only pages are not parsable. They have no common formatting on how Serendipity can now, this is the date, this is the teaser, this is the body, these are the comments, this is the author.

Thus, parsing HTML generically is IMPOSSIBLE. You need to write your own custom importer for that which does Preg-Matching.

Anyone with PHP knowledge can use our include/admin/importers/* importer classes as a foundation and put some regex matching into these importers.

Or turn your HTML into an RSS XML file via XSLT, which can be applied to XHTML pages as well.

Of course this is very advanced. I can offer to write an importer for HTML pages, but I cannot do that for free, because it would only help a single person and not any Serendipity user.

Best regards,
garvin