Sphinx for best commercial open source project


powered by Sphinx

Sphinx search (www.sphinxsearch.com) is one of the fastest developing search engine. Sphinx is used by Craigslist amongst several popular sites. If you have not already tried Sphinx and you are interested in a building your own search engine, give it a shot. Some of the salient features of sphinx is around easier integration with MySQL, fast indexing and support for distributed searches.

Support Sphinx and nominate it for the community choice awards!

Which companies are impacted by swine flu?

The swine flu epidemic (leading to a pandemic) has terrified the world. It’s impact can be felt across industries – with travel and hospitality being the worst hit. I tried to see which companies report a hit in performance because of swine flu. Using Gridstone’s Search Engine, I searched for swine flu. Here is a sneak-peak of the results:

Search results for swine flu using Gridstone Search

Continue reading

Gridstone Search – a SEC filings and transcripts search engine

Gridstone Search for SEC documents and Transcripts

Gridstone Search for SEC documents and Transcripts

Gridstone Search (http://search.gridstoneresearch.com) is avilable as beta for all the information seekers who have to sieve through SEC documents and company transcripts. Gridstone Search is now loaded with EDGAR filed documents (includes the five most important filing types – 10K, 6K, 10Q, 8K and 14A) starting 2007 and Transcripts provided by Seeking Alpha (through a unique partnership). Continue reading

Welcome the (Yahoo!) BOSS

Yahoo! Inc recently announced the release of BOSS – its open search (web-service based) platform. BOSS (Build your Own Search Service) exposes Yahoo Search through an App ID and a web-service. It will be interesting to see how BOSS matures around two main search engine buzz-words: vertical search and semantic search capabilities. I spent some time trying out BOSS on some of my favorite problems (Yes, I am going to share the URL on my website soon!). Here is a quick catch-up on the ‘cool’ and ‘not-so-cool’ BOSS features.

  1. Vertical Search Engines: It is not easy to construct a vertical search engine and use BOSS for searching on a few thousand sites. BOSS doesn’t support search for multiple sites (in fact it does it wrongly if you try to search for multiple sites through the web-service), nor does it allow regular expressions for sites to search. This doesn’t add up well for building up your own little vertical search engine. Google’s Custom Search Engine scores way higher on this one. With easy to configure websites and ability to tweak with the relevance and configurations Google CSE definitely scores higher. But BOSS has just arrived; I am sure changes will pour in soon.
  2. Semantic Search: Hakia has done it; many more will follow! This is a good value proposition by Yahoo and is primarily aimed at spicing-up the horizontal search market. It provides developers and researchers to try out their own versions of relevance and mash-ups on top of Yahoo’s rich information source. This however operates on a horizontal search model.
  3. Multi-modal Search: Yahoo has done a good job in providing Image and News Search features. Image searches however have relatively fewer use-cases, especially if the search is not based on Image content. A Riya like visual-search would have been awesome.

Currently there are no query limits on BOSS, but with rapidly increasing usage and adoption this is bound to change. The monetization abilities in both Yahoo and Google open search platforms is fairly limited and concentrates around Ads-World. For small enterprises and start-ups the ability to use these applications is therefore cumbersome, especially if the value proposition is being built around a vertical search model.

On a related note, Yahoo’s partnership with Hakia comes at an interesting time with Microsoft completing the Powerset acquisition. The semantic search war is catching up some steam!