[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [seul-edu] Re: perl or......

The compilation of word lists and indexes is quite important to
all within the academic community. From my experience few programs
handle this well. Adobe to Microsoft. It seems that Linux may have
some solutions. (This is my justification for taking your time.)

I have books, reading books, workbooks, etc where I need to check how
frequent words appear, make indexes of words used, and check where
special words are used.

Examples....I am currently rewriting spelling words and vocabulary words 
for all grades. (Currently, 1-6) What I do now is to make a list of 
words on a spreadsheet...example ab....away....abscond abnormal etc.
I also have lists of most common English words. I will take these lists 
and have a secretary compile an index using Word Perfect.

It would be nice to find out how frequently certain roots are used and 
then use simple and frequently used words for earlier grade spelling 
lists. How could I index multiple books? How could I make an index which 
would show where in the text the word appears? If the indexing shows a 
poor etymology...example of slapped as you mention, how could I remove 
that word. It would seem impossible to list all possible exceptions but 
it would be nice to easily edit the indexes.

I don't mind studying anything to do such work as my lists become more
complicated every year.

S. Barret Dolph
Taipei Taiwan

PS. Thank you for your support.

Ray Olszewski wrote:

> At 04:51 PM 12/3/00 +0800, S. Barret Dolph wrote:
>> I would like to be able to check some books for root words.
>> Example.... look for occurances of ped, cor, cit, in swift.txt
>> Would it be possible with Perl, or anything else, to find the occurances 
>> of a list of words, show where those words are, and save this task to a 
>> file?
> Others have already responded that "grep" will do this for you. But the
> output of grep will be fairly crude and hard to interpret. A more detailed
> description of what you want to accomplish might lead to better, more
> focused feedback.
> Please do not misunderstand me: the other suggestions are correct.I just
> suspect they will not do the unstated thing that you really want to accomplish.
> For example, take "ped". It is a root in "orthopedic" but not in "slapped".
> It takes more than grep to make that distinction; it takes a lot of design
> work, and I don't know of anything in the free/Open Source software world
> that operates at that level of sophistication. 
> --
> ------------------------------------"Never tell me the odds!"---
> Ray Olszewski                                        -- Han Solo
> Palo Alto, CA           	 	         ray@comarre.com        
> ----------------------------------------------------------------