bestbefore 2006-12-11 05:50
zdic 字典製作 - 如何自動分割? 已完成, 分享 longman language activator 中
女友在讀翻譯, 找了很久都沒有看見palm上能用的longman language activator
把心一橫, 想為她弄一個
下載了longman 的 Contemporary 4th edition, 再從其中取出language activator, 轉換成html
由html用ultraedit除去多餘的html tag
最後轉換成kdic/zdic的格式
無奈的是, language activator的格式, 是每一個單字有很多個衍生的解釋
因為kdic/zdic每個單字例句的4K的限制, 無法完整的轉換
找了很久,看了很多的, 嘗試用前的作的(如為wiki寫的程式, 但都不適合), 因為不懂編程, 不懂得如何每4K自動斷行
懇請高手賜教~
____________________________________________
[b]已攪定~[/b]
[b]謝小帥貓, 還有就是kav2376弄的工具[/b]
[b]~[/b]
[b]圖片, 鏈結, 介紹在第二頁 #14, #15, #16[/b]
[[i] 本帖最后由 bestbefore 于 2006-12-12 21:44 编辑 [/i]]
bestbefore 2006-12-11 05:51
以下是 "About"這一個單字的解釋, 可見每個單字詳細的程度, 對學習英語的朋友應該有很大的幫助
ABOUT
related concepts:
talk about see DISCUSS
ask about see ASK
approximately a number or amount see ABOUT/APPROXIMATELY
around a place or thing see AROUND/ROUND
connected with something or someone see CONNECTED/RELATED
***
1 - about a subject or person
about, on, concerning/regarding, on the subject of, re, with regard to, wrt
2 - to have something as the main subject
be about, deal with/be concerned with, concern, focus on
***
1 - about a subject or person
about
/`bat/ [preposition]
·Toby talks about you all the time.
·This leaflet should answer all the questions you have about switching to digital TV.
·I'm reading a really good book at the moment - it's about the French Revolution.
·What's all the fuss about?
about what/how/where etc
·I've been thinking about what you said, and I've decided that you're right.
on
/n || :n, :n/ [preposition] about a particular subject
·Lucia Piatti has written several books on the subject.
advice/opinions/ideas etc on
·If you want any advice on where to stay, Jan should be able to help you.
·His views on women are really old-fashioned.
on what/how/where etc
·Do you have any ideas on where to eat tonight?
book/programme/film etc on
·a book on 19th century English Literature
·a lecture on Native American customs and folklore
·Did you see that programme on South Africa last night?
concerning/regarding
/kn`s:&xR;n, r`g:&xR;d/ [preposition] formal about something or someone - use this especially to talk about information, ideas, or questions
·For any details concerning a particular country you should check with the embassy.
·Richard was called in by the police to answer questions concerning the disappearance of Thomas Ripley.
·If you have any questions regarding any of our services, please feel free to call me.
·Thank you for your letter regarding the annual subscription to our magazine.
on the subject of
/n `sbdkt v / [preposition] if you talk or write on the subject of someone or something, you talk or write about them
·Marie said nothing at all on the subject of Mr Bertram.
while we're on the subject
(=used when you are talking about a subject and want to say more about it)
·While we're on the subject of money, do you have that $10 you owe me?
re
/ri:/ [preposition] used in short business letters, messages etc
·Re planning meeting on Friday, please bring sales figures.
·To: John Deacon. From: Maria Soames. Re: computer system.
with regard to
/w r`g:&xR;d tu:/ [preposition] formal used especially to introduce a subject that you want to talk about in a speech, formal report, meeting etc
·With regard to the proposed new shopping mall, I would like to add a few remarks to those of the previous speaker.
·With regard to your letter concerning my January payment, this matter has now been settled.
wrt
use this in e-mails to introduce the subject you want to talk about. Wrt is an abbreviation of "with regard to"
·Wrt your looking out for a new computer, I saw a good offer in the paper yesterday.
·I'll be contacting Jean tomorrow wrt the Christmas party - is there anything you'd like me to suggest to her?
2 - to have something as the main subject
be about
/bi: `bat / [verb phrase]
·The play is about life in the Deep South in the 1930s.
·What's tonight's documentary about?
·Have you ever read 'Hideous Kinky'? It's about an English woman and her daughter travelling in Morocco.
deal with/be concerned with
/`di:l w , bi: kn`s:&xR;nd w / [transitive phrasal verb/verb phrase] to be about a particular subject, especially a serious one, and examine it carefully and in detail. Deal with is much more common than be concerned with
·The story deals with the psychological conflicts between mother and son.
·The earliest films made in India dealt with mythological subjects.
·Elton's books are often concerned with environmental issues.
concern
/kn`s:&xR;n/ [transitive verb] if a story, report etc concerns someone or something, it is about them and especially about what happens to them
·Much of the material in her early letters concerns events which happened some years before.
focus on
also centre on British/center on American /`fks n, `sentr n / [verb phrase] if something such as a book, article, or discussion focuses on a subject or problem, it is mainly about it and examines it in detail
·The next chapter will focus on this problem in greater detail.
·a weekly magazine mainly focusing on business-related issues
·The controversy centred on the question of illegal arms sales.
王 2006-12-11 10:41
为了女友,学会编程
她一定会被你感动的……
小帅猫 2006-12-11 11:47
楼主说的是朗文当代第4版吧……踏网嬉游早在两年前就转好了,长词条也做了处理(xxx, xxx.2, xxx.3)见我的贴子:http://bbs.tompda.com/viewthread.php?tid=2116075
还有,楼主可以看看我发的Collins(是进行长词条处理,并加了用法和同义词词典的超完整版),该词典特点是用完整句子来解释单词,很多人认为比牛津和朗文好……在上面的贴子上也有链接。
[[i] 本帖最后由 小帅猫 于 2006-12-11 11:52 编辑 [/i]]
bestbefore 2006-12-11 13:33
[quote]原帖由 [i]王[/i] 于 2006-12-11 10:41 发表
为了女友,学会编程
她一定会被你感动的…… [/quote]
王版主
我也有閃過一絲的念頭
在網上看了看教程
再看看我的畢業論文.....
我想我的教授不會因為感動而讓我畢業呢~:H:
bestbefore 2006-12-11 13:50
[quote]原帖由 [i]小帅猫[/i] 于 2006-12-11 11:47 发表
楼主说的是朗文当代第4版吧……踏网嬉游早在两年前就转好了,长词条也做了处理(xxx, xxx.2, xxx.3)见我的贴子:[url=http://bbs.tompda.com/viewthread.php?tid=2116075]http://bbs.tompda.com/viewthread.php?tid=2116075[/url]
还有,楼主可以看看我发的Collins(是进行长 ... [/quote]
再下了XD的collins, 果然不錯, 在此先謝了
~
不過應該是不同的
再舉一個例子
[b][size=3]DRAW[/size][/b]
[b]related words:[/b]
[i]see also [/i][b][color=#0000ff]PAINT[/color],[/b][b][color=#0000ff]PICTURE[/color],[/b][b][color=#0000ff]DESIGN[/color],[/b][b][color=#0000ff]ART/CULTURE[/color],[/b][b][color=#0000ff]PATTERN[/color][/b]
***
[b]1 [/b][b][color=#0000ff]to draw a picture, pattern, line etc[/color][/b]
[indent][b]draw[/b], [b]sketch[/b], [b]doodle[/b], [b]scribble[/b], [b]trace[/b][/indent]
[b]2 [/b][b][color=#0000ff]something that you draw[/color][/b]
[indent][b]drawing[/b], [b]sketch[/b], [b]doodle[/b], [b]scribble[/b]
[/indent][align=left]我在PALM可以找到的字典上, 還沒有找到可以查 DRAW這一個字就給上 Doodle這一個字的解釋[/align][align=left][/align][align=left]Language Activator它的構造比較特別[/align][align=left]我轉了之後才發現只有800多個單字[/align][align=left]但你在巿面上看到的一本 longman longuage activator不會比一本theaurus 薄, [/align][align=left]因為它每一個單字都詳列了它不同的用處[/align][align=left][/align][align=left]好像Draw一般會解作畫圖/畫畫等[/align][align=left]但如果我想要表達開會時很無聊[/align][align=left]拿著筆在亂畫的時候[/align][align=left]我查activator可以找到 doodle : to draw shapes, lines, or patterns without really thinking about what you are doing [/align][align=left]可以傳神的表達出真正的意思[/align][align=left][/align][align=left]XD說的朗文當代第4版我也有, 不過可能因為activator比較偏門, 所以當初就沒有收進去 (因為我在轉光碟時發現每個檔案是分開的)[/align][align=left][/align][align=left]謝謝XD的回答[/align][align=left]可以說說你在做collins COBUILD的時候是如何處理過長的單字嘛?:F: [/align]
[[i] 本帖最后由 bestbefore 于 2006-12-11 13:52 编辑 [/i]]
小帅猫 2006-12-11 15:21
实际上我做的Collins就是这样的啊……XD可以看看draw的第一条解释,再看看draw @T给出的第一条同义词,可是完全对应的哦……是不是和你说的差不多呢?
至于长词条问题,我是用的这个贴子里kav2376的工具,在第9页的最下面,1.02A版本:
[url]http://bbs.tompda.com/viewthread.php?tid=2100362&extra=page%3D1&page=9[/url]
用法是把要转的文件放在C:\下,改名成A.TXT,运行程序,生成B.TXT,就是切好的文件了……
这个工具切出来的可以说几近完美。我的Collins不是非常完美的原因:我用原来的音标版代替了切好的版本中的01和小于4089字节的单词(因为我下的字典原文本里不带音标),所以会出现02还会显示上一页解释中的最后几句,而不是完全紧跟着衔接的。好在是多,不是少(这个工具是按4000字节一切)
dpa 2006-12-11 21:08
[quote]原帖由 [i]小帅猫[/i] 于 2006-12-11 11:47 发表
楼主说的是朗文当代第4版吧……踏网嬉游早在两年前就转好了,长词条也做了处理(xxx, xxx.2, xxx.3)[/quote]
据我所知,踏网嬉游的朗文当代第4版长词条并没有做处理,仍然会被截断。(xxx, xxx.2, xxx.3)只是同一词的不同词性。
希望能有大侠出手解决这个问题。。。如此著名的一部词典却是残缺的?!
[[i] 本帖最后由 dpa 于 2006-12-11 21:17 编辑 [/i]]
wangluzhou 2006-12-11 22:27
[quote]原帖由 [i]王[/i] 于 2006-12-11 10:41 发表
为了女友,学会编程
她一定会被你感动的…… [/quote]
哇~~发现老王的思想很浪漫一点都不输给年轻人呢!:X: 当年追mm肯定很有一手吧?:D:
bobgreen 2006-12-11 22:31
人家老王天天和年轻人在一起,思想当然与时俱进啦
bestbefore 2006-12-12 00:38
[quote]原帖由 [i]小帅猫[/i] 于 2006-12-11 15:21 发表
实际上我做的Collins就是这样的啊……XD可以看看draw的第一条解释,再看看draw @T给出的第一条同义词,可是完全对应的哦……是不是和你说的差不多呢?
至于长词条问题,我是用的这个贴子里kav2376的工具,在第9页 ... [/quote]
先謝謝回覆~
看到了collins的解釋, 有差不多的字, 但是沒有相對同義詞的用法啊
其實即使有點相同也沒有所謂, 多一個字典與大家同享, 讓不同人多一個選擇, 也不是一件壞事啊~
我想長字句的問題應該煩擾到不少想弄字典的XD,
如果有個明確的解決方法, 也方便日後新版字典的製作:Y: ~
XD提及kav2376弄的工具我早前亦有試過, 不過不是你提及的版本
剛試了第9頁最下的一個
有進展, 每個單詞分割了, 但格式有點奇怪, 每個單字消失了
....
有點不好說明...:(:
即是, About, 不是變成ABOUT1 ,ABOUT2, 而是變成了 01, 02
字詞後面是否完整還未及較對
但想問問XD製作Collins時未處理的檔案格式
是否
單字tab解釋.....\n......................enter
如此?
因為我是用上述的格式為A.txt的
附上未轉換的A.txt 己及轉換後的B.txt
bestbefore 2006-12-12 00:51
[quote]原帖由 [i]dpa[/i] 于 2006-12-11 21:08 发表
据我所知,踏网嬉游的朗文当代第4版长词条并没有做处理,仍然会被截断。(xxx, xxx.2, xxx.3)只是同一词的不同词性。
希望能有大侠出手解决这个问题。。。如此著名的一部词典却是残缺的?! [/quote]
如果有辦法解決的話, 我可以順手轉換多一次朗文當代第4版, 反正源文件都在
~
小帅猫 2006-12-12 07:55
这个工具似乎只认“标准”的kdic格式,即:xxx(单词) /// (空格///空格)xxxxxxy(解释)\n(换行)……
楼主可以替换一下看看……
[[i] 本帖最后由 小帅猫 于 2006-12-12 07:57 编辑 [/i]]
bestbefore 2006-12-12 20:00
[quote]原帖由 [i]小帅猫[/i] 于 2006-12-12 07:55 发表
这个工具似乎只认“标准”的kdic格式,即:xxx(单词) /// (空格///空格)xxxxxxy(解释)\n(换行)……
楼主可以替换一下看看…… [/quote]
原來如此
一向都以為kdic的格式為
xxx單詞 TAB xxx解釋\n換行
gendic以此格式都沒有問題
原來"標準"的為 ///
受教了
一試, 果然成功
先上幾個圖
[[i] 本帖最后由 bestbefore 于 2006-12-12 20:03 编辑 [/i]]
bestbefore 2006-12-12 20:19
略為解釋一下格式吧
單詞
related word / related concept <<相關的字詞/範籌 (大寫的應該可以跳查)
*** << 分割
[1] - xxx 第一個用法 [xxx]為相近字詞
.
.
.
xxx << 分割
[1], 第一個用法的詳解
{xxx}第一個在這範籌的相近字的詳解/用法
*eg: 後的為例句
整體格式有點複雜
但礙於字典的型式, 實在想不出更好的表遠方法了
在此再謝過小帅猫 的幫忙
以後跟女友在圖書館溫書時, 不用帶本本也可以查到字典了:Y:
轉換後大小為
5.33 MB (5,592,717 位元組)
我試試分割為附件上載
[[i] 本帖最后由 bestbefore 于 2006-12-12 22:44 编辑 [/i]]
bestbefore 2006-12-12 20:32
轉換朗文當代的時
看見了還有其他的文件產生
完成了activator
原來一個是朗文的collocation
叧一個是example and pharses
源文件都很大
(當代的為 25M 多, Activator 近13 M, Collocation 95M多, Example & Pharses 為71M多)
看到collocation也不錯
就順手轉換了
不過由於格式不盡相同
有點雜亂, 有XD想要, 留言再分享吧 (因為轉換成 zdic格式後也有45M多, 要很多久啊)
不過有言在先這個的格式真的很亂, 因為 activator轉換很快, 修改了很多次啊
4x9 2006-12-12 22:14
第十一个文件怎么没有?
望补上
bestbefore 2006-12-12 22:46
[quote]原帖由 [i]4x9[/i] 于 2006-12-12 22:14 发表
第十一个文件怎么没有?
望补上 [/quote]
在香港啊
上載很慢啊~
IE掛掉了也不覺~
:O:
補上了
txican 2006-12-13 00:34
[url]http://www.live-share.com/files/115263/Activator.pdb.html[/url]
小帅猫 2006-12-13 10:42
不用谢,我应该代表网友们谢谢你,感谢你弄出了这么好的东西……