
{"id":109716,"date":"2025-11-02T05:48:39","date_gmt":"2025-11-02T05:48:39","guid":{"rendered":"https:\/\/mycryptomania.com\/?p=109716"},"modified":"2025-11-02T05:48:39","modified_gmt":"2025-11-02T05:48:39","slug":"taumode-a-new-way-of-searching-vector-databases","status":"publish","type":"post","link":"https:\/\/mycryptomania.com\/?p=109716","title":{"rendered":"taumode: a new way of searching vector databases"},"content":{"rendered":"<p>Here it is people: <strong>test proving taumode<\/strong>\u00a0\ud83c\udf44\ud83c\udf44\ud83c\udf44<\/p>\n<p>Since the publication of <a href=\"https:\/\/joss.theoj.org\/papers\/10.21105\/joss.09002\">my latest paper<\/a> I have received suggestions about testing my ideas on a real dataset. Here it\u00a0is!<\/p>\n<p>A complete unroll of the CVE dataset from 1999 to 2025 to:<br \/>\u2699\ufe0f build a fine-tuned embedder on domain-specific text <br \/>\ud83e\udde9 generate a taumode index for the embeddings<br \/>\u2754\u2754\u2754 query the index<br \/>\ud83e\uddee check the quality of the results against cosine similarity<\/p>\n<p>The results demonstrate that we can search better than current vector databases do\u00a0\ud83d\udc47\ud83d\udc47\ud83d\udc47<\/p>\n<p><a href=\"https:\/\/www.tuned.org.uk\/posts\/008_arrowspace_proof_of_concept_energy_informed_search\">008_arrowspace_proof_of_concept_energy_informed_search &#8211; AI Research Engineering<\/a><\/p>\n<p>Link to code available in the blog\u00a0post.<\/p>\n<p>\u200b<br \/>Please consider sponsoring me on Github -&gt; <a href=\"https:\/\/github.com\/sponsors\/Mec-iS\">https:\/\/github.com\/sponsors\/Mec-iS<\/a><\/p>\n<p><a href=\"https:\/\/www.linkedin.com\/search\/results\/all\/?keywords=%23vectordb&amp;origin=HASH_TAG_FROM_FEED\">#vectorDB<\/a> <a href=\"https:\/\/www.linkedin.com\/search\/results\/all\/?keywords=%23embeddings&amp;origin=HASH_TAG_FROM_FEED\">#embeddings<\/a> <a href=\"https:\/\/www.linkedin.com\/search\/results\/all\/?keywords=%23search&amp;origin=HASH_TAG_FROM_FEED\">#search<\/a> <a href=\"https:\/\/www.linkedin.com\/search\/results\/all\/?keywords=%23ranking&amp;origin=HASH_TAG_FROM_FEED\">#ranking<\/a> <a href=\"https:\/\/www.linkedin.com\/search\/results\/all\/?keywords=%23matching&amp;origin=HASH_TAG_FROM_FEED\">#matching<\/a><\/p>\n<p><a href=\"https:\/\/medium.com\/coinmonks\/taumode-a-new-way-of-searching-vector-databases-eea79973ecbf\">taumode: a new way of searching vector databases<\/a> was originally published in <a href=\"https:\/\/medium.com\/coinmonks\">Coinmonks<\/a> on Medium, where people are continuing the conversation by highlighting and responding to this story.<\/p>","protected":false},"excerpt":{"rendered":"<p>Here it is people: test proving taumode\u00a0\ud83c\udf44\ud83c\udf44\ud83c\udf44 Since the publication of my latest paper I have received suggestions about testing my ideas on a real dataset. Here it\u00a0is! A complete unroll of the CVE dataset from 1999 to 2025 to:\u2699\ufe0f build a fine-tuned embedder on domain-specific text \ud83e\udde9 generate a taumode index for the embeddings\u2754\u2754\u2754 [&hellip;]<\/p>\n","protected":false},"author":0,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[2],"tags":[],"class_list":["post-109716","post","type-post","status-publish","format-standard","hentry","category-interesting"],"_links":{"self":[{"href":"https:\/\/mycryptomania.com\/index.php?rest_route=\/wp\/v2\/posts\/109716"}],"collection":[{"href":"https:\/\/mycryptomania.com\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/mycryptomania.com\/index.php?rest_route=\/wp\/v2\/types\/post"}],"replies":[{"embeddable":true,"href":"https:\/\/mycryptomania.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=109716"}],"version-history":[{"count":0,"href":"https:\/\/mycryptomania.com\/index.php?rest_route=\/wp\/v2\/posts\/109716\/revisions"}],"wp:attachment":[{"href":"https:\/\/mycryptomania.com\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=109716"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/mycryptomania.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=109716"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/mycryptomania.com\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=109716"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}