[bksvol-discuss] imges to text

  • From: Cindy <popularplace@xxxxxxxxx>
  • To: bksvol-discuss@xxxxxxxxxxxxx
  • Date: Tue, 28 Dec 2004 23:41:44 -0800 (PST)

Thanks, Shelley.  I don't think I have all those
options. I tried using Scan to File to scan a page of
text, and I was able to save it, but how do I convert
to text to edit? I tried opening with Word but just
got a lot of symbols. Maybe my 1660 isn't
sophisticated enough. The only thing "recognize" is
part of is the OCR.

Well, since I'm not planning to do a lot of scanning,
I guess I don't need to know -- but I do like to learn
these things. I appreciate your and Guido's efforts to
educate me (smile).

Cindy


> under Find Reader.
> 
> You would pick "Scan"
> 
> And then scan the images.
> 
> Then when you are done, pick
> 
> "read all"
> 
> Which of course reads the images and determines
> what's on them.
> 
> Then "send pages to" under file.
> 
> That is what I know.
> 
> Shelley L. Rhodes and Judson, guiding golden
> juddysbuddy@xxxxxxxxxxxx
> Guide Dogs For the Blind Inc.
> Graduate Advisory Council
> www.guidedogs.com
> 
> The vision must be followed by the venture. It is
> not enough to
> stare up the steps - we must step up the stairs.
> 
>       -- Vance Havner
> ----- Original Message ----- 
> From: "Cindy" <popularplace@xxxxxxxxx>
> To: <bksvol-discuss@xxxxxxxxxxxxx>
> Sent: Tuesday, December 28, 2004 10:40 PM
> Subject: [bksvol-discuss] Re: txt page breaks redux
> 
> 
> Guido,
> 
> That's very interesting. I, too, have the 1660, but
> of
> course I don't have Kurzweil. I think it's Fine
> Reader
> that I ascertained it came with, but I'm not sure.
> 
> If I used Image instead of OCR, how would I convert
> the images to OCR, or can I not without Kurzweill? I
> thought I tried once when I scanned with Image on by
> mistake, but I may be wrong.
> 
> The computer and scanner are for me like a car -- I
> can drive it but I understand little about how it
> works, except for all the information you and others
> here share. (shamefaced smile)
> 
> Cindy
> 
> --- Guido Corona <guidoc@xxxxxxxxxx> wrote:
> 
> > Cindy,  I am using a lowly EPSON 1660 with the
> > Kurzweil 9.02 software.  As
> > I scan at 400 DPI in grayscale,  the OCR engine
> > would take a lot longer to
> > recognize if I scanned and recognized pages
> > simultaneously.  So I scan
> > images only.  Then I submit the entire stack of
> > images for recognition at
> > the end  Recognition will take anything from 20
> > minutes to 2 or even three
> > hours,  depnding on the length of the book and the
> > degree of difficulty
> > encountered in the reco process.  No skin off my
> > back though,  I can do a
> > lot of other things during that time:  my
> attention
> > is not needed.
> >
> > As I recall,  books with errors amounting to or
> less
> > than 0.7% are deemed
> > excellent.
> > Between 0.7% and 1.5% are 'good'.  Submission with
> > more than 1.5% error
> > rate are deemed fair.
> >
> > But the reviewer has the opportunity to exercise
> > judgment and override the system evaluation, up or
> > down.  If a book seems
> > to have holes with frequent missing or corrupted
> > words,  I nuke it,  no
> > matter what the system evaluates, and add a
> comment
> > for the administrator.
> > If the book has a bunch of 'the' misspelled as
> 'die'
> > I try to fix them one
> > at a time,  unless I find that to be a thankless
> > task when that turns out
> > to be just one aspect of a much bigger problem.
> > So Cindy,  I think you know my opinion by now.
> >
> > Guido
> >
> > Guido Dante Corona
> > IBM Accessibility Center,  Austin Tx.
> > Research Division,
> > Phone:  512. 838. 9735.
> > Email: guidoc@xxxxxxxxxxx
> > Web:  http://www.ibm.com/able
> >
> >
> >
> >
> > Cindy <popularplace@xxxxxxxxx>
> > Sent by: bksvol-discuss-bounce@xxxxxxxxxxxxx
> > 12/28/2004 06:24 PM
> > Please respond to
> > bksvol-discuss
> >
> >
> > To
> > bksvol-discuss@xxxxxxxxxxxxx
> > cc
> >
> > Subject
> > [bksvol-discuss] Re: txt page breaks redux
> >
> >
> >
> >
> >
> >
> > Guido,
> >
> > You must have wonderful scanner!!!! There's no way
> I
> > can scan a book that quickly, since I have to scan
> a
> > page at a time, and can convert 8 - 12 pages at a
> > time.
> >
> > Anyway, since so many other people are happier
> > scanning, I'm leaving that to them -- but unless
> > everybody submits in rtf format, that still leaves
> > the
> > problem of hard page breaks in txt books, which
> > apparently some of you and put in and others of us
> > cannot. My solution, as things stand now, is to
> > download a txt file, reject it, and re-submit it
> as
> > an
> > rtf file -- which means someone else will then
> have
> > to
> > validate it.
> >
> > But you bring up something I've been wondering
> > about:
> > Should a book that is spell-checked only, and
> > garbage
> > removed, be approved as being in excellent
> > condition,
> > or good? What about scanning errors that pass the
> > spell-check, like "be" for "he," the number one
> for
> > capital I, lie for the, etc. And words that are
> > missing completely from sentences. I know the
> > excellent rating allows for some errors, but how
> > many
> > before it becomes Good instead of Excellent? I
> > recently worked in a book that had a lot of
> missing
> > words. I would suspect that the omissions wouldn't
> > have made much difference to the reader, and I
> > suppose
> > that in the cases of the other examples I gave any
> > reader could make changes as he/she read, but I
> > wonder
> > if it wouldn't be better for books that haven't
> been
> > read and corrected by the validator to have a Good
> > rating and leave the Excellent for books that have
> > been done more carefully.
> >
> > Cindy
> > --- Guido Corona <guidoc@xxxxxxxxxx> wrote:
> >
> > > I know this will sound so dreadfully heartless,
> > no
> > > Charitable Seasonal
> > > spirit and all the rest.  But It takes a grand
> > total
> > > of just 1 hour and 5
> > > minutes to scan an entire 450 page paperback
> book,
> >
> > > page breaks, font
> > > info, and all the rest.  Than it takes about 90
> > > minutes to do some basic
> > > cleanup, and finally an average of a couple of
> > hours
> > > to spellcheck it.
> > >
> > > I really do not understand why we are even
> > bothering
> > > to discuss salvage
> > > operations for DOA submissions,  when the
> culling
> > ax
> > > and a quick rescan is
> > > the only merciful course of action for most of
> > these
> > > runts.
> > >
> > > Guido Dante Corona
> > > IBM Accessibility Center,  Austin Tx.
> > > Research Division,
> > > Phone:  512. 838. 9735.
> > > Email: guidoc@xxxxxxxxxxx
> > > Web:  http://www.ibm.com/able
> > >
> > >
> > >
> > >
> > > "Marissa Mika" <Marissa.M@xxxxxxxxxxxx>
> > > Sent by: bksvol-discuss-bounce@xxxxxxxxxxxxx
> > > 12/28/2004 05:35 PM
> > > Please respond to
> > > bksvol-discuss
> > >
> > >
> > > To
> > > <bksvol-discuss@xxxxxxxxxxxxx>
> > > cc
> > >
> > > Subject
> > > [bksvol-discuss] Re: txt page breaks redux
> > >
> > >
> > >
> > >
> > >
> > >
> > > Hi Cindy,
> > >
> > > We're still working on it. (Gotta love
> consensus,
> > > huh?) Look for a
> > > message from me by the end of the week.
> > >
> > > Did everyone have a good Christmas?
> > >
> > > Marissa
> > >
> > > -----Original Message-----
> > > From: bksvol-discuss-bounce@xxxxxxxxxxxxx
> > > [mailto:bksvol-discuss-bounce@xxxxxxxxxxxxx] On
> > > Behalf Of Cindy
> > > Sent: Wednesday, December 22, 2004 9:21 PM
> > > To: bksvol-discuss@xxxxxxxxxxxxx
> > > Subject: [bksvol-discuss] txt page breaks redux
> > >
> > > Hi, Marissa,
> > >
> > > Thanks for the new list.
> > >
> > > Is there any word yet on what to do with txt
> files
> > > or
> > > if they will be accepted without hard breaks,
> with
> > > spaces and page numbers instead? That doesn't
> > > prevent
> > > the breaks Word puts in in the wrong places, but
> > by
> > > adding line spaces or changing font the file can
> > > probably be made to coincide with the book.
> > >
> > > When I finish Johnny Tremain I'm thinking of
> > fixing
> > > one of those troublesome romances, since I found
> a
> > > copy. As things stand now, I think  the best
> thing
> > > for
> > > me to do is to reject the txt file and submit a
> > new
> > > rtf file with page breaks.
> > >
> > > Cindy
> > >
> > >
> > >
> > >
> > >
> > > __________________________________
> > > Do you Yahoo!?
> > > Take Yahoo! Mail with you! Get it on your mobile
> > > phone.
> > > http://mobile.yahoo.com/maildemo
> > >
> > >
> > >
> > >
> >
> >
> > __________________________________________________
> > Do You Yahoo!?
> > Tired of spam?  Yahoo! Mail has the best spam
> > protection around
> > http://mail.yahoo.com
> >
> >
> >
> 
> 
> 
> 
> __________________________________
> Do you Yahoo!?
> Send a seasonal email greeting and help others. Do
> good.
> http://celebrity.mail.yahoo.com
> 
> 
> 
> 




                
__________________________________ 
Do you Yahoo!? 
Yahoo! Mail - 250MB free storage. Do more. Manage less. 
http://info.mail.yahoo.com/mail_250

Other related posts:

  • » [bksvol-discuss] imges to text