Text Processing

OOo Label Templates 1.0

Free Opendocument (ODF) label templates are available for downloading from Worldlabel.com. Set-up time is quick and designing and printing labels from these templates is easy. The templates include CD, mailing, vhs tape, sizes for book plates and more. The templates will work on Open Source text editors like Openoffice.org Writer and Kword. US letter and European/Asian a4 sizes available.

Please visit: Label Templates to view the complete collection.

iVia in DLib

Don writes: "iVia - the software that runs the INFOMINE gateway - is described in an article from D-Lib Magazine, Jan. 2003 - "iVia Open Source Virtual Library System". Wow!"


from fm: "ParaTools is a set of Perl modules for the handling of document references. It includes two citation parsers, a document parser, OpenURL support, Web service examples, and detailed documentation. The toolkit is available as open source, and has been designed to be easily expandable. The parsing functionality in ParaTools is already in use in the ParaCite system."


from freshmeat: "This version ported g3data to GTK+ 2.2. All deprecated functions were removed."


from freshmeat: "This release uses gdk-pixbuf instead of Imlib for image manipulation. The ability to scale and image with command line parameters has been added." Find it at the g3data site.


This came out about a month ago, must have missed it. From the g3data site: "g3data is used for extracting data from graphs. In publications graphs often are included, but the actual data is missing. g3data makes the extracting process much easier." Very, very slick, and while it's a niche application there will likely be a moment each of you will need this upon encountering a frustrated researcher on deadline at the reference desk. Reminds me of registering datapoints on old maps using GIS.



more meat, freshlike: "Concordance is a simple concordancing tool for the Linux (and possibly other Unices) console, with regexp capabilities. It scans a text file and outputs concordance lines based on a node entered by the user."


from freshmeat: "g3data is a program for extracting data from graphs (i.e., scanned graphs from scientific publications). It can read many different image formats and outputs the extracted data through stdout." Way cool; any sense of how many researchers are doing this kind of data reanimation?



as seen at freshmeat: "Etext string searches are now working. Added CREDITS, MANUAL files. Cleaned up the Makefile. Refined the keybindings. Init method looks for .gutenbook and Gutenberg_Library directories; makes them if they don't exist. This in preparation for implementing user preferences UI and saving etexts locally." all this and more available at the Gutenbook site.



as seen at the new gutenbook.org site: "A whole heap of enhancements and features... Implemented page searching; Implemented user preferences and preference interface (note that cached index auto-update is currently disabled); Bound key "p" to preferences interface, "Ctrl-p" to page search; Implemented local index caching; Cached index represents local etext presence; Implemented etext auto-save feature; Implemented etext read-local feature." Most definitely appreciate the shouts, too. :)

Syndicate content