1 2011-04-16T00:05:01  *** MattMaker has quit IRC
   2 2011-04-16T02:29:37  <CIA-25> Thomas Waldmann <tw AT waldmann-edv DOT de> default * 178:2f1708707657 2.0/quickinstall: quickinstall now uses curl or wget (fix for no wget on Mac OS X, thanks to Graham Dumpleton)
   3 2011-04-16T02:29:40  <CIA-25> Thomas Waldmann <tw AT waldmann-edv DOT de> default * 179:939013ce69fe 2.0/quickinstall: quickinstall: simplify
   4 2011-04-16T03:15:33  *** cosmo_ has joined #moin-dev
   5 2011-04-16T03:17:09  *** cosmo has quit IRC
   6 2011-04-16T05:28:48  *** MattMaker has joined #moin-dev
   7 2011-04-16T05:29:19  *** z^ck has quit IRC
   8 2011-04-16T05:29:30  *** z^ck has joined #moin-dev
   9 2011-04-16T06:36:21  *** MattMaker has quit IRC
  10 2011-04-16T07:08:03  <dreimark> moin
  11 2011-04-16T08:13:28  *** Marchael has joined #moin-dev
  12 2011-04-16T08:13:36  <Marchael> moin
  13 2011-04-16T11:16:22  *** Marchael has quit IRC
  14 2011-04-16T11:32:55  *** Marchael has joined #moin-dev
  15 2011-04-16T11:46:10  *** ronny has joined #moin-dev
  16 2011-04-16T12:52:48  <ThomasWaldmann> moin
  17 2011-04-16T13:03:01  *** greg_f has joined #moin-dev
  18 2011-04-16T13:34:55  *** m4k3r has joined #moin-dev
  19 2011-04-16T13:47:48  *** ronny has quit IRC
  20 2011-04-16T14:01:38  *** ronny has joined #moin-dev
  21 2011-04-16T14:13:32  *** ronny has quit IRC
  22 2011-04-16T14:39:00  *** m4k3r has quit IRC
  23 2011-04-16T14:40:05  *** m4k3r has joined #moin-dev
  24 2011-04-16T14:41:20  *** m4k3r has quit IRC
  25 2011-04-16T14:49:12  * dreimark greetings from cologne
  26 2011-04-16T14:49:41  <ThomasWaldmann> ah, right. have fun!
  27 2011-04-16T14:50:03  <ThomasWaldmann> if something moin2ish comes up, point people to this channel.
  28 2011-04-16T14:50:17  <dreimark> i may be have a talk at 16:45
  29 2011-04-16T14:50:27  <dreimark> and try to run the client too
  30 2011-04-16T14:50:41  <ThomasWaldmann> you don't know yet? :)
  31 2011-04-16T14:50:50  <dreimark> currently we are on track
  32 2011-04-16T14:50:57  <dreimark> may be a bit later
  33 2011-04-16T15:23:52  *** andialbrecht_ has quit IRC
  34 2011-04-16T15:51:11  <ThomasWaldmann> Marchael: btw, what qp class would you use for the search box input field processing?
  35 2011-04-16T15:53:54  <Marchael> ThomasWaldmann: What does it mean "qp class"?
  36 2011-04-16T15:54:21  <ThomasWaldmann> whoosh offers misc. queryparser classes
  37 2011-04-16T15:54:32  <Marchael> aah
  38 2011-04-16T15:56:20  <Marchael> If I understand right you mean something like this http://packages.python.org/Whoosh/api/qparser.html#parser-object ?
  39 2011-04-16T15:58:18  <ThomasWaldmann> yes
  40 2011-04-16T16:01:34  <Marchael> So I will use it of corse, but I`m worried about parsing queries like "mimetype:text/html rev_no:1 search query"
  41 2011-04-16T16:02:51  <Marchael> by default I will find all documents which names matches given query
  42 2011-04-16T16:04:30  <ThomasWaldmann> btw, i asked for the class name.
  43 2011-04-16T16:08:14  * Marchael reading description of all classes
  44 2011-04-16T16:10:58  <Marchael> whoosh.qparser.QueryParser
  45 2011-04-16T16:12:02  <Marchael> I`m also reading information about plugins for qp http://packages.python.org/Whoosh/api/qparser.html#plug-ins
  46 2011-04-16T16:18:32  *** MattMaker has joined #moin-dev
  47 2011-04-16T16:19:10  <Marchael> Althought, for searching in multiple fields I`m need whoosh.qparser.MultifieldParser
  48 2011-04-16T16:35:20  <Marchael> ThomasWaldmann: Am I answered to your question?
  49 2011-04-16T16:38:48  <Marchael> I also think what it will be good idea to add FieldAliasPlugin. For example "mimetype" == "mime" or "revision" == "rev"
  50 2011-04-16T16:47:36  <Marchael> xorAxAx: hi
  51 2011-04-16T16:47:43  <Marchael> My apologies what I couldn't answer to your comment on melange soon. Last 3 days I actively working on my coursework, so I had no time to think about answer to your comment.
  52 2011-04-16T16:47:48  <xorAxAx> Marchael: hi
  53 2011-04-16T16:47:56  <Marchael> > How do you think you want to implement the search language? Will you reuse the parser of Whoosh? Will you use a parsing toolkit like they did?
  54 2011-04-16T16:47:56  <Marchael> Search language will be look like http://moinmo.in/HelpOnSearching. I don't want to write my own search language, because whoosh already has whoosh.qparser.QueryParser for this.
  55 2011-04-16T16:47:59  <xorAxAx> thats ok
  56 2011-04-16T16:48:39  <xorAxAx> but woosh doesnt have this language ... do youi want to convert the queries, Marchael?
  57 2011-04-16T16:48:52  <xorAxAx> and how do you want to parse the language explained on that wiki page?
  58 2011-04-16T16:55:01  <xorAxAx> Marchael: ping
  59 2011-04-16T16:55:22  <Marchael> whoosh have many different plugins for parsing search quieries http://packages.python.org/Whoosh/api/qparser.html#plug-ins. Whoosh have similar query language.
  60 2011-04-16T16:55:22  <Marchael> For example if you want to use phrase searching then whoosh will be use whoosh.qparser.PhrasePlugin.
  61 2011-04-16T16:55:22  <Marchael> If you want to all documents starting with "help" but didn`t want to find "helponsearch" expression will be look like "NO helponsearch AND help*". For this whoosh will using whoosh.qparser.WildcardPlugin
  62 2011-04-16T16:56:10  <xorAxAx> so what?
  63 2011-04-16T16:56:16  <xorAxAx> please answer my questions
  64 2011-04-16T16:56:36  <Marchael> etc
  65 2011-04-16T16:57:55  <Marchael> I do not want to convert quieries if whoosh can understand it without convertion
  66 2011-04-16T16:59:13  <Marchael> if you type "query string" on search box whoosh will recognize this as phrase without convertion
  67 2011-04-16T17:02:45  <xorAxAx> but the whole language is NOT understood by whoosh, so what is your plan?
  68 2011-04-16T17:06:53  <Marchael> xorAxAx: you meant language describing on HelpOnSearch?
  69 2011-04-16T17:11:46  <xorAxAx> yes
  70 2011-04-16T17:23:57  <Marchael> Whoosh can understand queries like "content:help". So I`m can design schema with this fields(exclude case and regexp of course)
  71 2011-04-16T17:24:09  <Marchael> sorry for long answers
  72 2011-04-16T17:26:50  <xorAxAx> yeah, you will have cases which whoosh cannot understand. how do you want to handle them?
  73 2011-04-16T17:30:42  <ThomasWaldmann> xorAxAx: btw, i think moin2 is rather different from moin1 anyway, so it is not strictly required that search queries are 100% compatible. of course if it can be made easily to understand the same queries, one would do that.
  74 2011-04-16T17:31:11  <xorAxAx> ThomasWaldmann: yes, its no requirement, indeed, but he wanted to achieve that goal, so he should explain how
  75 2011-04-16T17:31:20  *** franklin has joined #moin-dev
  76 2011-04-16T17:32:03  <Marchael> xorAxAx: can you provide some example of query which whoosh can not understand?
  77 2011-04-16T17:32:43  <xorAxAx> Marchael: you just gave an example
  78 2011-04-16T17:35:27  <ThomasWaldmann> Marchael: yes, MultifieldParser looks good for a combined title search and text search box
  79 2011-04-16T17:37:17  <Marchael> xorAxAx: I do not find how whoosh can perform case sensitive searching and searching by regexp(althought '*' and '?' will be enough I guess)
  80 2011-04-16T17:37:26  <Marchael> s/find/found/
  81 2011-04-16T17:38:03  <xorAxAx> Marchael: then you need to extend whoosh
  82 2011-04-16T17:39:48  <Marchael> althought whoosh support some regexp which contains <, >, >=, <=, =>, or =<
  83 2011-04-16T17:40:39  <ThomasWaldmann> Marchael: in case you mean comparisons / range stuff: that is not regexp
  84 2011-04-16T17:41:27  <ThomasWaldmann> please read some stuff about regular expressions so you'll know better what is one and what is not
  85 2011-04-16T17:42:44  <Marchael> ThomasWaldmann: I`m just copypaste this from there http://packages.python.org/Whoosh/api/qparser.html#whoosh.qparser.GtLtPlugin
  86 2011-04-16T17:49:26  <ThomasWaldmann> they use a regex there for the plugin, but that doesn't make regexes part of the query language
  87 2011-04-16T18:05:49  <Marchael> ThomasWaldmann: I think it will be good idea to write feature request to whoosh ml because I don't know how much time takes writing regexp and case sensitive search support
  88 2011-04-16T18:07:43  <xorAxAx> Marchael: i think that it rather makes sense to implement regex on the moin side of things
  89 2011-04-16T18:08:13  <xorAxAx> same for case sensitive things
  90 2011-04-16T18:10:55  <xorAxAx> e.g. case sensitive search is application of a whoosh search operator and a moin search operator that does a plain fulltext search
  91 2011-04-16T18:11:50  <Marchael> xorAxAx: and what if we have something like this '^[^o0-9]*' ?
  92 2011-04-16T18:12:19  <xorAxAx> then you do a plain fulltext search with that regex
  93 2011-04-16T18:14:46  <ThomasWaldmann> what makes more sense maybe depends what whoosh author wants to have in whoosh and what not
  94 2011-04-16T18:23:09  <Marchael> for case sensitive search I can use following algoritm:
  95 2011-04-16T18:23:09  <Marchael> 1) Search as usual text
  96 2011-04-16T18:23:09  <Marchael> 2) get list with search results
  97 2011-04-16T18:23:09  <Marchael> 3) Use re for each element in list
  98 2011-04-16T18:26:15  <xorAxAx> yes, for example
  99 2011-04-16T18:26:33  <ThomasWaldmann> yes, that'ld work. requires the stuff for re :)
 100 2011-04-16T18:30:28  <Marchael> I'm need to go back home, get back soon :)
 101 2011-04-16T18:30:33  *** Marchael has quit IRC
 102 2011-04-16T19:12:44  *** Marchael has joined #moin-dev
 103 2011-04-16T19:14:22  <Marchael> xorAxAx: Am I answered to your question?
 104 2011-04-16T19:16:09  <xorAxAx> no
 105 2011-04-16T19:16:25  <Marchael> xorAxAx: What is unclear for you/
 106 2011-04-16T19:16:28  <Marchael> ?
 107 2011-04-16T19:16:43  <xorAxAx> you still need to extend whoosh's query syntax to accomodate regex and case searches, you didnt tell us how you plan to do it
 108 2011-04-16T19:18:22  <Marchael> I already wrote about case sensitive
 109 2011-04-16T19:19:01  <xorAxAx> you only described the algorithm, not the query language
 110 2011-04-16T19:21:14  <Marchael> queries will be look like in moin1(case:word)
 111 2011-04-16T19:21:27  <xorAxAx> and how do you parse them?
 112 2011-04-16T19:22:37  <Marchael> using re for analyzing request coming from search box
 113 2011-04-16T19:28:29  <Marchael> For finding regexp in request I also will be using re and if it found regexp then re will be using this regex on search results
 114 2011-04-16T19:30:17  <Marchael> s/re will be using/re will use/
 115 2011-04-16T19:33:31  <xorAxAx> why not a parser toolkit?
 116 2011-04-16T19:33:44  <xorAxAx> easier, easily extensible
 117 2011-04-16T19:35:50  <Marchael> xorAxAx: qparser module?
 118 2011-04-16T19:36:19  <xorAxAx> hmm?
 119 2011-04-16T19:36:37  <Marchael> http://packages.python.org/Whoosh/api/qparser.html#module-whoosh.qparser
 120 2011-04-16T19:36:37  <xorAxAx> a parsing toolkit takes a grammar e.g. in BNF form and generates a parser from it
 121 2011-04-16T19:36:48  <xorAxAx> whoosh uses pyparsing
 122 2011-04-16T19:36:58  <Marchael> aah
 123 2011-04-16T19:40:15  * Marchael reading about BNF
 124 2011-04-16T19:42:32  <ThomasWaldmann> we maybe really first look at how whoosh parses queries before putting a whole new thing on top of this
 125 2011-04-16T19:43:19  <xorAxAx> ThomasWaldmann: yes, hte best idea is to extend the pyparsing desc of it
 126 2011-04-16T19:43:32  <xorAxAx> Marchael: read the pyparsing docs
 127 2011-04-16T19:43:41  <Marchael> ok
 128 2011-04-16T19:43:47  <xorAxAx> we learn about BNF in first semester in germany
 129 2011-04-16T19:44:20  <Marchael> second semestr of 3th curse in Russia :)
 130 2011-04-16T19:44:55  <xorAxAx> "3th curse"?
 131 2011-04-16T19:45:08  <Marchael> *course
 132 2011-04-16T19:45:11  <xorAxAx> 3rd course? what is a semester of a course?
 133 2011-04-16T19:47:21  <Marchael> second
 134 2011-04-16T19:50:33  <ThomasWaldmann> xorAxAx: why are you talking about whoosh using pyparsing?
 135 2011-04-16T19:52:44  <ThomasWaldmann> and are we both talking about whoosh 1.8.2?
 136 2011-04-16T19:54:40  *** greg_f has quit IRC
 137 2011-04-16T19:57:18  <ThomasWaldmann> xorAxAx: looks like you are looking at whoosh < 1.0
 138 2011-04-16T19:57:50  <ThomasWaldmann> "New hand-written query parser based on plug-ins. Less brittle, more robust, more flexible, and easier to fix/improve than the old pyparsing-based parser."
 139 2011-04-16T19:58:05  <ThomasWaldmann> from 1.0 changelog entry
 140 2011-04-16T19:58:33  <ThomasWaldmann> looks like he wasn't that happy with pyparsing
 141 2011-04-16T19:59:49  <xorAxAx> ThomasWaldmann: i was looking at the website
 142 2011-04-16T20:00:16  <ThomasWaldmann> don't use whoosh.ca
 143 2011-04-16T20:00:41  <Marchael> http://packages.python.org/Whoosh/index.html
 144 2011-04-16T20:00:44  <ThomasWaldmann> bitbucket and pypi are much more current
 145 2011-04-16T20:04:34  <ThomasWaldmann> Marchael: xorAxAx: (or anybody else): if you find spelling errors or other minor thing in the whoosh docs, please tell me
 146 2011-04-16T20:05:10  <Marchael> ok
 147 2011-04-16T20:05:30  * ThomasWaldmann wants to feed back such stuff upstream
 148 2011-04-16T20:05:31  <xorAxAx> ThomasWaldmann: i looked at bitbucket
 149 2011-04-16T20:13:13  <ThomasWaldmann> xorAxAx: i fixed the wiki page
 150 2011-04-16T20:13:32  <xorAxAx> ok
 151 2011-04-16T20:28:53  <Marchael> xorAxAx: for parsing queries I can use something like this MultifieldParser(["title", "regexp",...], ix.schema).parse(u'title:none regexp:^[^\!]*')
 152 2011-04-16T20:28:53  <Marchael> this will return parsed request
 153 2011-04-16T20:30:21  <Marchael> looking like And([Term(u'title', u'shit'), Term(u'regexp', u'^[^\!]*')])
 154 2011-04-16T20:30:43  <Marchael> ooops
 155 2011-04-16T20:35:38  <Marchael> I meant u'title', u'none' of corse :)
 156 2011-04-16T20:37:28  <Marchael> *of course
 157 2011-04-16T20:41:24  *** Marchael has quit IRC
 158 2011-04-16T20:41:31  *** Marchael has joined #moin-dev
 159 2011-04-16T21:01:31  *** m4k3r has joined #moin-dev
 160 2011-04-16T21:04:36  <Marchael> xorAxAx: so what you think?
 161 2011-04-16T21:20:55  <ThomasWaldmann> btw, I'ld suggest reading once through the whoosh docs and also through the whoosh src code
 162 2011-04-16T21:41:59  *** Marchael has quit IRC
 163 2011-04-16T21:50:15  <ThomasWaldmann> xorAxAx: still there?
 164 2011-04-16T22:29:37  <ThomasWaldmann> xorAxAx: waldi: i am currently thinking about whether it makes sense to run revision data through dom conversion (data->dom, dom->simple text) before giving it to the search indexing analyser (tokenizer, filters)
 165 2011-04-16T22:32:28  *** humbu has joined #moin-dev
 166 2011-04-16T22:32:35  *** humbu has left #moin-dev
 167 2011-04-16T22:39:37  <waldi> ThomasWaldmann: well. you don't want to feed html data into it usually
 168 2011-04-16T22:40:29  <ThomasWaldmann> i didn't say html
 169 2011-04-16T22:41:03  <ThomasWaldmann> the point is that if we directly try to index revision data, we have 2 issues:
 170 2011-04-16T22:41:34  <ThomasWaldmann> a) different kind of text items (wiki markup, rst, maybe even html, docbook/xml, ...)
 171 2011-04-16T22:41:57  <ThomasWaldmann> b) non-text data (word docs, pdfs, images, ...)
 172 2011-04-16T22:42:15  <waldi> you said "data"
 173 2011-04-16T22:42:26  <ThomasWaldmann> for a) it would be nice to get out text only information
 174 2011-04-16T22:42:34  <ThomasWaldmann> (no markup)
 175 2011-04-16T22:42:58  <ThomasWaldmann> for b) we need text extraction anyways (see MoinMoin.filter in 1.9)
 176 2011-04-16T22:58:08  <dreimark> re
 177 2011-04-16T22:58:10  <dreimark> Rietveld Code Review Tool
 178 2011-04-16T22:58:25  <dreimark> is an interesting tools to get clean commits
 179 2011-04-16T22:58:45  <dreimark> and also less changesets
 180 2011-04-16T22:59:07  <dreimark> at least less fix a fix changesets
 181 2011-04-16T23:07:59  <ThomasWaldmann> the docs tell it is for use with subversion
 182 2011-04-16T23:16:28  <ThomasWaldmann> ah, docs outdated, seems to support mercurial also
 183 2011-04-16T23:19:24  <dreimark> it supports mercurial and it does not need only app engine
 184 2011-04-16T23:20:35  <dreimark> some images from today http://www.flickr.com/photos/onyame/
 185 2011-04-16T23:22:47  <dreimark> http://openetherpad.org/coderview
 186 2011-04-16T23:24:51  <dreimark> at the dinner andi told that there is also a version available which independent from app engine
 187 2011-04-16T23:25:03  <dreimark> need to ask him tom. about the url
 188 2011-04-16T23:41:57  *** MattMaker has quit IRC
 189 

MoinMoin: MoinMoinChat/Logs/moin-dev/2011-04-16 (last edited 2011-04-15 22:45:03 by IrcLogImporter)