| 19 | |
| 20 | ---- |
| 21 | |
| 22 | === Analyzer === |
| 23 | |
| 24 | What filters should we use in our Analyzer? What is necessary are at least: !StandardTokenizer, !StandardFilter, and !LowerCaseFilter. |
| 25 | |
| 26 | Will we use a !StopFilter and if so, how do we decide what (language-dependent )stopwords list to use? |
| 27 | |
| 28 | Do we use an NGramTokenFilter to help fuzzy searches? How is this better than using !FuzzyQuery at query time? |
| 29 | |
| 30 | Do we use an ISOLatin1AccentFilter to abstract over accented characters? (heikki: +1) |
| 31 | |
| 32 | Do we use a !PhoneticFilter? If so how does this work, with different languages and all? |
| 33 | |
| 34 | Do we use a !SynonymFilter? The language dependent issue is relevant here, again. |
| 35 | |
| 36 | Do we use a !SnowballFilter (stemming) ? Again, how will we deal with the different languages? |
| 37 | |
| 38 | |
| 39 | |
| 40 | |
| 41 | |