| wang 的个人资料Nirvana照片日志列表 | 帮助 |
|
1月21日 Some ScheduleGood life, good paid. Need to scratch out a schedule to keep myself on the right track.
1. The firest priority is Qualify Exam, I would like to explore the possibility of Syntax Level Search
What i have done,
A. Collect enough test data
B. So so NLP Parser, but not good.
C. One possible way of Storage
D. Nutch's Index Structure
E. The relationship between storage on data repository and Database
What need to do
A. Deal with question query, not only transform into keywords ,but also give a reasonable query expanding technique.
A1. Data preprocess: Change Sentence Format/ and maybe more, Need careful think!
B. Keyword Search have page rank, is there a counterpart in Syntax Search?
C. Index Schema on DBMS level storage
D. Possibility of data repository and if possible how to deal with the index, XML formate storage's adv and disAdv.
E. Answer Engine VS Search Engine, is the an integration method?
Something industry interested
1. How to design distributed crawler to handle huge data
Actually, i don't know too much on how to implement a crawler!
2. How to design index technology facing huge data, fast and stable
3. The definition of scalability
4. Good C/CPP capability to implement an indexer
5. Semantic Search ?
6. A lot , i dont know, need to figure out
Need to finish main outline of the Research Plan, especially with Yellow Items. Leave room for extension.
2. Techique
2.1. I want to learn a little about linux network programming
Need to read some text book, I know what to do, but don't be lazy
Possiblly, i need some help!
2.1. Read the book about Debug.
3. IR and Search
3.1 Some paper on D:/Search/paper
3.2. BM25
4. Interested in Compiler
Read zhaomin's compiler book, thanks zhaomin.
5. Spend some spare time on joujou's PS and RL.
6. No surfing on Web, instead, watch TV and play guitar.
----------------------------------------------------------------------------------------
Live is a good band, make you feel easie. Enjoy living with you sole.
|
|
|