Search COVID-19 Open Research Dataset (CORD-19) using Vespa - Open Source Big Data Serving Engine
<p><a href="https://www.linkedin.com/in/kraune/">Kristian Aune</a>, Tech Product Manager, Verizon Media<br/></p><p><b></b></p><p>After being made aware of the <a href="https://www.kaggle.com/allen-institute-for-ai/CORD-19-research-challenge">COVID-19 Open Research Dataset Challenge (CORD-19)</a>, where AI experts have been asked to create text and data mining tools that can help the medical community, the Vespa team wanted to contribute. </p><p><b></b></p><p>Given our experience with big data at Yahoo (now Verizon Media) and creating <a href="https://vespa.ai/">Vespa</a> (open source big data serving engine), we thought the best way to help was to index the dataset, which includes over 44,000 scholarly articles, and to make it available for searching via Vespa Cloud.</p><p><b></b></p><p><b>Now live at <a href="https://cord19.vespa.ai/">https://cord19.vespa.ai</a></b>, you can get started with a few of the sample queries or for more advanced queries, visit <a href="https://github.com/vespa-engine/cord-19/blob/master/cord-19-queries.md">CORD-19 API Query</a>. Feel free to tweet us <a href="http://www.twitter.com/vespaengine">@vespaengine</a> or <a href="https://github.com/vespa-engine/cord-19/issues">submit an issue</a>, if you have any questions or suggestions.</p><p><b></b></p><p>Please expect daily updates to the documentation and query features. Contributions are appreciated - please refer to our <a href="https://github.com/vespa-engine/cord-19/blob/master/CONTRIBUTING.md">contributing</a> guide and submit PRs. You can also download the application, index the data set, and improve the service. <a href="https://github.com/vespa-engine/cord-19/blob/master/experiment-yourself.md">More info here</a> on how to run Vespa.ai on your own computer. </p>