Discussion:
[scribus] ODT into Scribus
John Hodge
2018-11-08 13:35:26 UTC
Permalink
Hi all.
I have odt and docx files which were outputted by an OCR prog which has tried to reproduce the 'layout' of the magazine-type pages originally scanned, including text frames, fonts, formatting etc. 
I am wanting to import these into Scribus, maintaining the fonts, font sizes and other typeface formatting, but also, importantly, the text frames.

You can edit and manipulate these text frames and formatting with LibreWriter, but it's very awkward - nowhere near Scribus for this task. 
Can Scribus import odts maintaining this formatting and layout in editable form? If not, could this be a request?
Thanks,JH
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.scribus.net/pipermail/scribus/attachments/20181108/c26d5255/attachment.html>
___
Scribus Mailing List: ***@lists.scribus.net
Edit your options or unsubscribe:
http://lists.scribus.net/mailman/listinfo/scribus
See also:
http://wiki.scribus.net
http://
Peter Nermander
2018-11-08 13:42:11 UTC
Permalink
Post by John Hodge
Can Scribus import odts maintaining this formatting and layout in editable
form? If not, could this be a request?
From what I know: No, and yes, but it is very unlikely that someone is
willing to spend the time needed to code this.

Scribus can to some extent import ODT into a text frame, but that's about
it.

/Peter
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.scribus.net/pipermail/scribus/attachments/20181108/ae5fd781/attachment.html>
___
Scribus Mailing List: ***@lists.scribus.net
Edit your options or unsubscribe:
http://lists.scribus.net/mailman/listinfo/scribus
See also:
http://wiki.scribus.net
http://forums.scribus.net
Gregory Pittman
2018-11-08 14:14:58 UTC
Permalink
Post by Peter Nermander
Post by John Hodge
Can Scribus import odts maintaining this formatting and layout in editable
form? If not, could this be a request?
From what I know: No, and yes, but it is very unlikely that someone is
willing to spend the time needed to code this.
Scribus can to some extent import ODT into a text frame, but that's about
it.
ODT isn't really a frames environment. The main thing you get from bringing an ODT document into Scribus is that styles are preserved and created in Scribus for further use and editing. You will still have to create some sort of layout with your Scribus frames.

Greg


___
Scribus Mailing List: ***@lists.scribus.net
Edit your options or unsubscribe:
http://lists.scribus.net/mailman/listinfo/scribus
See also:
http://wiki.scribus.net
http://forums.scribus.net
Rolf-Werner Eilert
2018-11-08 14:42:08 UTC
Permalink
Post by John Hodge
Can Scribus import odts maintaining this formatting and layout in editable form? If not, could this be a request?
Just had a look into an odt document. Its text/layout part consists
mainly of an .xml file. This contains simple text elements like
paragraph styles, linefeed commands + text which would have to be
interpreted serially. Then there are some elements like svg commands for
lines etc. and commands for boxes.

All in all, I would say "everything is there" to rebuild the page from
such an xml (there is also a meta.xml file that contains basic page and
paragraph definitions for the whole document).

You would have to code an interpreter which is able to re-build the page
after those definitions. No voodoo, but I guess even with help from the
LibreOffice guys, that's a long way mate ;)

As a workaround, if the pages you get out of the OCR software are ok,
maybe just let LibreOffice make a PDF from it and include that into your
Scribus document. What do you think?

Regards
Rolf

___
Scribus Mailing List: ***@lists.scribus.net
Edit your options or unsubscribe:
http://lists.scribus.net/mailman/listinfo/scribus
See also:
http://wiki.scribus.net
http://forums.scribus.net
Gregory Pittman
2018-11-08 15:04:46 UTC
Permalink
Post by John Hodge
Can Scribus import odts maintaining this formatting and layout in editable form? If not, could this be a request?
Just had a look into an odt document. Its text/layout part consists mainly of an .xml file. This contains simple text elements like paragraph styles, linefeed commands + text which would have to be interpreted serially. Then there are some elements like svg commands for lines etc. and commands for boxes.
All in all, I would say "everything is there" to rebuild the page from such an xml (there is also a meta.xml file that contains basic page and paragraph definitions for the whole document).
You would have to code an interpreter which is able to re-build the page after those definitions. No voodoo, but I guess even with help from the LibreOffice guys, that's a long way mate ;)
As a workaround, if the pages you get out of the OCR software are ok, maybe just let LibreOffice make a PDF from it and include that into your Scribus document. What do you think?
The problem with importing a PDF is that the text glyphs are vector graphics, so rather difficult to edit. I have done this, and even altered text somewhat, but matching the size of new text with these glyphs is tricky.

Greg


___
Scribus Mailing List: ***@lists.scribus.net
Edit your options or unsubscribe:
http://lists.scribus.net/mailman/listinfo/scribus
See also:
http://wiki.scribus.net
http://forums.scribus.net
Rolf-Werner Eilert
2018-11-08 15:18:29 UTC
Permalink
Post by Gregory Pittman
Post by John Hodge
Can Scribus import odts maintaining this formatting and layout in editable form? If not, could this be a request?
Just had a look into an odt document. Its text/layout part consists mainly of an .xml file. This contains simple text elements like paragraph styles, linefeed commands + text which would have to be interpreted serially. Then there are some elements like svg commands for lines etc. and commands for boxes.
All in all, I would say "everything is there" to rebuild the page from such an xml (there is also a meta.xml file that contains basic page and paragraph definitions for the whole document).
You would have to code an interpreter which is able to re-build the page after those definitions. No voodoo, but I guess even with help from the LibreOffice guys, that's a long way mate ;)
As a workaround, if the pages you get out of the OCR software are ok, maybe just let LibreOffice make a PDF from it and include that into your Scribus document. What do you think?
The problem with importing a PDF is that the text glyphs are vector graphics, so rather difficult to edit. I have done this, and even altered text somewhat, but matching the size of new text with these glyphs is tricky.
Greg
That is true. So I would care for the page being "complete" prior to
export and import. You could use all the decent tools Writer offers,
like spellchecking, correcting, positioning etc. and only then export
the PDF.

When you want text-only import into a textframe in Scribus, you will
check its contents as well.

Regards
Rolf


___
Scribus Mailing List: ***@lists.scribus.net
Edit your options or unsubscribe:
http://lists.scribus.net/mailman/listinfo/scribus
See also:
http://wiki.scribus.net
http://forums.scribus.net

Loading...