.It may have been August but there was no noticeable diminution of the rate of publication of research papers on arXiv relating to the enterprise applications of AI, with a particular focus on search applications. This is far from a comprehensive listing and is inevitably biased towards my own areas of research interest. This month I have listed the search research under a specific heading. I would be the first to acknowledge that the distinction is very blurred!
Enterprise-wide applications
To Classify is to Interpret: Building Taxonomies from Heterogeneous Data through Human-AI Collaboration https://arxiv.org/abs/2307.16481
Lessons in Reproducibility: Insights from NLP Studies in Materials Science https://arxiv.org/abs/2307.15759
Three Bricks to Consolidate Watermarks for Large Language Models https://arxiv.org/abs/2308.00113
AI Literature Review Suite https://arxiv.org/abs/2308.02443
Should we trust web scraped data? https://arxiv.org/abs/2308.02231
What has ChatGPT read? The origins of archaeological citations used by a generative artificial intelligence application https://arxiv.org/abs/2308.03301
Large Language Model Prompt Chaining for Long Legal Document Classification https://arxiv.org/abs/2308.04138
NLLG Quarterly arXiv Report 06/23: What are the most influential current AI Papers? https://arxiv.org/abs/2308.04889
Through the Lens of Core Competency: Survey on Evaluation of Large Language Models https://arxiv.org/abs/2308.07902
GPTEval: A Survey on Assessments of ChatGPT and GPT-4 https://arxiv.org/abs/2308.12488
Beyond Document Page Classification: Design, Datasets, and Challenges https://arxiv.org/abs/2308.12896
Search and information retrieval
On the Effects of Regional Spelling Conventions in Retrieval Models https://arxiv.org/abs/2308.00480
Generative Query Reformulation for Effective Adhoc Search https://arxiv.org/abs/2308.00415
Evaluation of Conversational Agents for Aerospace Domain https://ceur-ws.org/Vol-2621/CIRCLE20_21.pdf
Large Language Models for Information Retrieval: A Survey https://arxiv.org/abs/2308.07107
Informed Named Entity Recognition Decoding for Generative Language Models https://arxiv.org/abs/2308.07791
Improving Neural Ranking Models with Traditional IR Methods https://arxiv.org/abs/2308.15027
Vector Search with OpenAI Embeddings: Lucene Is All You Need https://arxiv.org/abs/2308.14963
Is ChatGPT a Biomedical Expert? Exploring the Zero-Shot Performance of Current GPT Models in Biomedical Tasks https://arxiv.org/pdf/2306.16108.pdf
Martin White 19 September 2023