jump to navigation

Webicina search: Interview about Semantic indexing August 30, 2011

Posted by Dr. Bertalan Meskó in Health, Health 2.0, Medical Search, Medicine, Medicine 2.0, Web 2.0, Webicina.
trackback

Now that Webicina.com has a new design and a brand new search engine, I thought it would be useful to show the basic features and I also publish an interview with Endre Jóföldi, CEO at WebLib LLC, the company behind the search engine of Webicina.com.

If you do a search for diabetes, you will see

  • whether Webicina has a diabetes resource (a blog, podcast, Facebook group, Twitter user, etc), collection (Diabetes and Web 2.0) or sub-section (Diabetes Mobile Apps).
  • You can narrow the search by “social media collection” (e.g. a diabetes resource in the asthma collection), “curated dynamic news category” (e.g. news categories featuring diabetes resources), “resource type” (RSS, resource, subsection or collection) or “languages”.

  • Please tell us more about WebLib! What kind of projects are you famous for?

Generally we are building intelligent search solutions for our customers like the NIH or Vanderbilt University Biomedical Library. Behind the scenes we are also working on a semantic web knowledge base to improve our core search engine.

  • What is the engine behind the Webicina search function like?

Our PolySearch enterprise search engine is a best-of-class domain independent semantic indexing and search solution built on top of the open source Solr/Lucene enterprise search platform. We have built several different search solutions based on this engine, which are all different in some sense based on the different needs of our customers.
PolySearch uses grammatical tools to find different forms of words and also uses our SearchComplete to offer health related search queries when the user typing his/her queries. It also utilize our medical spell checker to help correcting typos.

  • How does clustering search help us find exactly what we need?

We face the problem of information overload after doing searches. For example if you search for cancer on Webicina  you will get more then 40 thousand results. Our relevance algorithm floats up the most important Webicina resources, however if you want to look through the tons of RSS results, it definitely helps to filter it down to your area of interest. Like you will see only 479 results in the Nutrition category. This is a much more friendly number.

  • How difficult was it to develop a search engine for Webicina.com?

We tried to make a very fast engine, what helps Webicina visitors to find all the information Webicina treasures. It is also handling the different resource types and languages so we think it can really improve the user experience.  More than that we have still plans how to make it even better, so it is going to be a longer process, where we want to use the data how the visitors are going to use this search service.

Comments»

No comments yet — be the first.

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Connecting to %s

Follow

Get every new post delivered to your Inbox.

Join 134 other followers