Projects
- 2Problems
"Some people, when confronted with a problem, think 'I know, I'll use regular expressions.' Now they have two problems." --Jamie Zawinski
2Problems is a GUI utility to process multiple files through a set of regular expressions. Sets of expressions are placed, one per line, in a "rule file", which should be placed in the rules subfolder....
- AccessDBtoMARC
This project is an attempt to automate the extraction of MARC records from a specific MS Access database in use by MSEAG and possibly used by other SIL libraries in Asia. Extracting MARC records is essential to "freeing" the data for use in/by other programs....
- ArtOfReading
This site is for developers.
If you're just looking for Art Of Reading, head over to http://bloom.palaso.org/art-of-reading/
SIL International publishes "International Illustrations: The Art of Reading", as a DVD with a 3rd-part commercial image library program. They have agreed that the prose license on that DVD is consistent with Creative Commons Attribution-No Derivatives license, and thus that we (who also work for SIL, but that's irrelevant) may distribute them with that license. This project consists of batch files which ...
- Bloom
Bloom is a desktop computer software application. It aims to encourage a low-training, high-output system where mother tongue speakers and their advocates work together to foster both community authorship and access to external material in the vernacular....
- Chorus
Chorus is version control system designed to enable workflows appropriate for typical language development teams who are geographically distributed. These teams need to edit a set of common files, working towards a publication. They want to share their work while, crucially, being able to defer dealing with conflicting edits for periods of time or until a qualified team member can make decisions about the conflicts. The system is implemented on top of a commonly used Open Source Distributed Version Control System. It works in scenarios in which users are connected by Local Area Network, Internet, or hand-carried storage devices. Chorus supports several workflow models, including those that maintain a “master” database separate from the incoming submissions of team members. Quite unlike the version control systems commonly in use, the system works invisibly for the common cases and is kept simple for even beginner computer users....
- Chorus for Adapt It
A program to download (clone) and synchronize Adapt It projects (bob_eaton@sall.com)
- Chorus Help Binaries
Documentation binaries for Chorus
- ResumableHg
resumableHg provides a server API (PHP) to transfer Hg bundles in chunks of data that can be retried/retransmitted via http in a way that is tolerant of poor connectivity.
- SampleDictionary
This is a data project, for testing chorus collaboration.
- DdpMerger
A tool to help merge different versions of ddp in various formats.
Currently this project is extremely specific to wesay's ddp file format, but hopefully there will be some useful stuff in it to build on. - DFM build environment
This is an adaptation of the DictionaryForMids (DFM) command line build process. DFM is a Java ME app for feature phones (Nokia mid range phones) and Android when used with DictionaryForMids app from the Play Store. It is of use for people wanting to process multiple language dictionaries. The source needs to be XML. Toolbox XML for MDF or Philippines Dict SFM are handled. Lift XML can generate a word list only dictionary (at the moment). Pathway is adding DFM output so Fieldworks dictionary data can be handled that way. Four outputs sizes are handled for MDF and PLB XML input files. 1. Word list 2. Min size. 3. Mid Size 4. Full sized. This is a Windows command line tool. ...
- FLEx Bridge
FLEx Bridge is an application that allows multiple FieldWorks 8.0 users to do remote collaboration (i.e., not necessarily connected by a local network).
The FLEx Bridge application has merged in the functionality of Lift Bridge, so it now supports Send/Receive for the full FieldWorks data set, as well as smaller set of LIFT dictionary data. FLEx Bridge uses Chorus to do the "heavy lifting" of the collaboration work for both sets of data. Users can collaborate with others using one or both types of data....
- Fonts and Scripts
This is a meta-project containing font or script projects as sub projects.
- font-lao1
OFL Lao body text font for minority use
- GAFAWS
GAFAWS is a .Net (C#) implementation of Joseph E. Grimes' 1983 work entitled Affix Positions and Cooccurrences: The PARADIGM Program. Grimes' original implementation was written in PTP....
- Graphite IDE
Graphite Integrated Development Environment
- graphiteng-dev
the future of graphite
- Graphite Android
Graphite for Android application integration library
- HearThis
HearThis is a prototype of a super-simple recording of Bible texts by native speakers.
- HelpProvider
This is a modest attempt at improving the excellent work done by Wiktor Zychla here and here based on the ideas of Tom Clement over at CodeProject....
- HTML SemDom Gather Words
HTML5 Semantic Domain Gather Words
This HTML5 Offline app allows you to gather words using your tablet or mobile phone and then upload those words to an existing language depot wesay project.
The intention is to package this HTML5 app as an app for both iOS, Android and Windows Phone 8. It may also be published on the web as an offline app.... - IBus
Backports of IBus
- ibus-deb
IBus debian package directory
- ibus-kmfl-deb
builds of ibus-kmfl
- ibus-xkb-layouts-deb
use xkb layouts with ibus
- icu-dotnet
ICU Dot Net provides .Net bindings to the icu library. We also maintain a minimal build of the icu library that essentially provides collation only.
- InKey
autohotkey based keyboard manager
- Jiration
Jiration is a rich client written with .net (c#) which uses a web service to access a Jira server. Using Jiration, you can add issues, comment on them, and resolve them much faster than using even a fast web link.
So far, it is just hacked together for my own use....
- Koha Setup
Notes on installing, configuring, customizing and importing data into Koha, an open source integrated library system (ILS).
-
- LF .Net Client
The language forge client provides a user interface, and RPC mechanism for .Net apps to create new language forge projects, and assign users to the project.
- LF Deploy
A small php script that manages the auto deployment of languageforge.org
- LF Merge
Server side data update and merge tool for LanguageForge.
- LF Shared Components
Shared components for Language Forge. Includes both client side and server side shared components.
- LF Theme
Experimenting with the theme
- LF Web
The Drupal website that is the frame / host for the LanguageForge GWT web applications.
- LFDictionaryView
A simple dictionary web app. Takes a lift file, and shows it as an interactive dictionary.
See the demo here: http://dev2.languageforge.org/DictionaryDemo/App.php
The code is at: http://hg.palaso.org/lflexview
- Lift Bridge
A plugin for FieldWorks Language Explorer which permits sharing of LIFT dictionary data with remote colleagues using FLEx or WeSay.
Status¶
(Last revision of this page: 12 April 2013)
Lift Bridge (Windows) is available for download. This version can only be installed on FLEx versions 7.2.4 -> 7.2.7....
- Lift IO
This project is the home for the development of an XML schema for the storing of lexical information, as used in the creation of dictionaries. It also includes a .net library for working with LIFT (parsing, validating, & migrating).
- LiftTools
Assorted utilities for LIFT files.
Current Utilities:- Validate Lift File. Ensure the file is compliant with the latest version of the LIFT schema.
- Merge Homographs
- Split out each sense containing a CAWL word list number into its own entry
- Remove glosses & definitions written in unwanted writing systems...
- LiftTweaker
LiftTweaker was written to prepare a Papua New Guinea lexicon (koluwawa) for printing. The program prepares "tweak files" which can be applied to a LIFT file to produce subsets of it for different publication targets (e.g. Children's, Categorized by semantic domain, technical, vernacular). It does this by allowing the lexicographer to selectively remove some things from the dictionary for each kind of target publication. This includes removing unnecessary "see also" links to nearby words, and removing words based on parts of speech, so that she can publish proper-names as a separate section of the dictionary....
-
- cli-common
Backport of cli-common - common files for packaging cli apps and libs
- geckofx-deb
Debian packaging for GeckoFx. Needs to be kept uptodate every 6 weeks or so as a new version of geckofx is created for the version of firefox currently available for Ubuntu.
- mono-addins-deb
mono addins backport
- mono-deb
This is the debian-src source for the debian packaging of the Palaso mono builds.
- mono2.0
Palaso build based on Mono2.0
- Mono2.4
Palaso build based on Mono2.4
- xulrunner-deb
backport of xulrunner-1.9.2 for oneiric and precise for chorus app
- MultiScribe
Multiscribe is an add-on for Windows applications that enables text rendering in complex scripts not otherwise supported by Windows. Multiscribe uses the Graphite rendering engine developed by the Non Roman Script Initiative (NRSI) of SIL International.
... - myWorkSafe
Smart & Simple Backup for Language Development Workers
Downloads and more information is available at the project's web site, http://myWorkSafe.palaso.org
Software Developers¶
Source Code¶
To get the source code, you'll need Mercurial. Windows users, install TortoiseHg. Then from a command line, go to where you keep your development projects, and give this command:...
- OpenOffice Linguistic Tools
This add-on provides a menu of tools for linguistic writeups and other documents written in lesser-known languages. It reads XML files from Phonology Assistant, Toolbox, and FieldWorks. Other features include Data Conversion using SIL Converters.
To install, download the oxt file and double-click to open it. The Extension Manager should open and install the add-on. For more detailed installation instructions and version history, see README.txt...
- OurWord
OurWord is Bible Translation software designed for national translators, who are translating the Scriptures into the world's minority languages, and who typically have not had significant computer experience.
Summary
We are coining the term "manual adaptation" to refer to OurWord's process. Software such as Carla and AdaptIt are implementations of "computer adaptation", where the computer replaces words from one language with words in the other language in order to produce an initial draft. With "manual adaptation" the translator still relies on the source language, but rather than receiving word-by-word suggestions from the computer, the translator crafts the translation himself. Thus he is able to address such things as resolving collocational clashes, deciding on appropriate idioms, etc.... - Palaso Library
A set of .Net libraries useful for developers of Language Software. See Main Page for more information.
- Palaso LDML
The Palaso LDML extensions. Provides a specification and rng grammar for the Palaso extensions to the LDML Writing Systems specification.
- Palaso Python Library
Contains palaso developed python modules
- Paratext Service
Web service like library providing consumers with the ability to linguistic services to Paratext.
- PdfDroplet
A small .net app for making rearranging pages of a pdf to make a booklet. Here, we just handle the source code and bug reports. See the website to read about and download the app itself.
- Prophero
Prophero is a program that takes Scripture in various forms, formats it as HTML, and generates a set of related files to create a simple concordance. The resulting set of files does not require any special smarts on a server, so can be directly published to the web or simply copied to a CD or hard drive for local access. See Scripture Publisher Doc.html in the repository for full details. ...
- Publishing
This is a parent project which holds sub projects that are related to publishing systems development.
- Photo Album Maker
Python script for driving Scribus to create a photo album, plus picture processing scripts.
- Rapid Publication Manager
The Rapid Publication Manager (RPM) is a type of publication management engine. It is design to work with various publication project types to process project components.
- SayMore
SayMore is an application for making it easy to organize your Oral Language Documentation metadata, media, and other artifacts.
This site is primarily the home of the code repository; the primary public site is http://saymore.palaso.org.
Here are the sites which make up the SayMore development infrastructure:...
- SayMore-Doc
Documentation for SayMore
- SayMoreOld
SayMore is an application for making it easy to organize your Oral Language Documentation metadata, media, and other artifacts.
This site is primarily the home of the code repository; the primary public site is http://saymore.palaso.org.
Here are the sites which make up the SayMore development infrastructure:...
- Scim
Scim keyboard switching enhancement
- Scim Autoswitcher
Test App for the scim extension
- Scratch NRSI
Temporary repository for learning hg
-
- font-sukhothai
SukhoThai font development project
- Script-fonipa
IPA Script support project. Primarily includes mappings.
- Scripture Checks
A web app to facilitate scripture checks.
- SendIt
A very simple little app which talks to AdaptIt and sends exported SFM files via email.
- sfmDictionary2web
Used to generate web dictionaries from SFM files. It was written to publish SIL Philippines schema dictionaries to web pages. It can also be used with other schema as it is re-configurable. The output is frame based like LexiquePro web output. It differs in that there is only one entry per page. There is also a client side search. An example can be seen here: http://www.sil.org/asia/philippines/online/ivb/dict/...
- SheetSwiper
Sheet Swiper converts spreadsheets containing linguistic data to Standard Format (for use with Linguist's Toolbox, Phonology Assistant, Lexique Pro, etc.). The rules for how to do the conversion come from the first rows of the spreadsheet itself, so subsequent, repeated conversions are easy for non-computer savvy users to do....
- Solid
Solid is a software tool that can be used to check, clean up, and convert Standard Format (e.g. Toolbox) lexicon data.
- tipe
Project home for the TeX Integrated Publishing Environment, yet another TeX publishing tool that has the aspiration of enabling normal humans to use TeX. We'll see how that goes.
- usfmTex
A fork of the ptx2pdf TeX Scripture processing macro package.
- USFMToSwordModule
A front-end to the command-line tools from Crosswire.org for building a Sword Module from USFM files. The front-end is for Windows only, and requires Perl and a minimum of .Net 2.0 to be installed.
Please visit the homepage to see more information including the download link:...
- VimodPub
Various Inputs Mutiple Output Digital Publishing
This is a set of files that can fairly quickly create various publishing outputs from one source. This tool set is aimed at publishers doing digital publishing. The working format is XML. Input formats could be SFM, XML or XHTML or other structured input....
- WeSay
Users¶
You're in the wrong place. Head over to http://wesay.palaso.org.
Testers¶
Please see Tips for Testing Palaso Software
You will also want to follow the WeSay User List...
- WeSay Documentation
Manual and other documentation for WeSay.
- WeSay Localization
WeSay localization
- WSEdit
A CLDR (LDML) editor implemented by wrapping the writing systems UI component from the Palaso Library
Please add any issues to the Palaso Library Issue Tracker...
- WSI AutoTools
Writing System Implementation automated build template system
- wsiwaf
Using waf to build WSIs
- XLingPaper
Writing linguistic documents in XML
Also available in: Atom