Replies: 2 comments
-
the result is fully ok - xml/html docs are parsed as plaintext and you search plain word tokens will be brought down to html (since < and > are not usually parsed anyways, even if you try to enclose them in "", so "" it won't help, since < and > are simply not in index)xml analyzer could be improved in a sense that it would recognize just tags, or perhaps to go even more generic the whole "full search" could be improved now that we have more powerful lucene undernath so it indexes really "full" text ... so your Q - yes, it is possible, however you get some false positives just because you search by token, and not fulltext (don't confuse OpenGroks full search and fulltext search !!!) |
Beta Was this translation helpful? Give feedback.
-
short term ... I guess creating own xml tokenizers (instead of using generic plain ones) might get us where we want, not sure if we need to modify full search for every single file |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
We have some webapp project (that is html, php or jsp files) and would like to search the usage of a given tag in those files.
We tried to type in
<html>
in Full Search field and selectXML
in Type field.It shows the query parsing result to be:
+full:html +type:xml
, and the result set contains some irreverent records, like:<meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
So is it possible to search the HTML/XML tag currently?
FYI, we are using 0.12-rc2.
Thank you for your work, this tool is really useful!
Beta Was this translation helpful? Give feedback.
All reactions