Classifier. The classifier tries to identify the query-topic using query-terms and training data consisting of topic- descriptions discussed above. In our implementation we use a multinomial NBC, with maximum likelihood estimates to determine the topic probabilities of the query. For a query q, we compute the probability of membership of the query for different topic-classes as, P (c |q) = P (q|ci) Γ P (ci) β P (c ) Y P (q |c ) (6) where TSRki is the topic-sensitive SourceRank score of source sk for topic-class ci. CSRs give the query- topic sensitive SourceRank for all deep-web sources. Since CSR is computed during query-time, it is im- portant that its processing time is kept to a minimal. CSR will be used in conjunction with a relevance mea- sure as described below. Hence CSR computation can be limited to selected top-k most relevant topics.
Appears in 2 contracts
Sources: Source Selection Agreement, Source Selection Agreement