Skip to content
This repository has been archived by the owner on Jul 25, 2023. It is now read-only.

spiffytech/bible_fixer

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

38 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Bible Fixer

The YouVersion Bible texts include import errors where some words don'thave spaces betweenthem like they should. This program finds all such cojoined words, identifies the two correct words, and outputs a CSV of string replacements that is easy to machine-parse, in the hopes YouVersion will take my program output and fix this so it stops bugging me :p

There are two parts to this- the downoader script (written in Node.js because I thought Node was neat then, and didn't want to rewrite it), and the program that parses all the downloaded Bible texts and does the real work, written in Go.

I made the processing program as accurate as I could, but some real words cannot be distinguished from cojoined words programatically. I tried to scrub these manually, but didn't check all 5,800 replacements :p

This is designed to be used with the God's Word translation. Other translations include HTML elements (footnotes, etc.) that I didn't bother to filter out because GWT doesn't have them, so you'll wind up with bogus things smattered around the verse text this program tries to process. Should be an easy thing to resolve if you cared.

Install

npm install should take care of the downloader.

For the golang portion, which processes the text and finds typos:

go get github.com/bmizerany/pq
go get github.com/coopernurse/gorp
go get golang.org/x/net/html
go get github.com/PuerkitoBio/goquery
go get github.com/pmylund/go-cache
go build

About

Identifies words in the YouVersion Bible texts that are accidentally joined, rather than having a space between them, and offers corrections

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published