Research on enterprise AI August 2023

.It may have been August but there was no noticeable diminution of the rate of publication of research papers on arXiv relating to the enterprise applications of AI, with a particular focus on search applications.  This is far from a comprehensive listing and is inevitably biased towards my own areas of research interest. This month I have listed the search research under a specific heading. I would be the first to acknowledge that the distinction is very blurred!

Enterprise-wide applications 

To Classify is to Interpret: Building Taxonomies from Heterogeneous Data through Human-AI Collaboration  https://arxiv.org/abs/2307.16481

Lessons in Reproducibility: Insights from NLP Studies in Materials Science https://arxiv.org/abs/2307.15759

Three Bricks to Consolidate Watermarks for Large Language Models https://arxiv.org/abs/2308.00113

AI Literature Review Suite https://arxiv.org/abs/2308.02443

Should we trust web scraped data? https://arxiv.org/abs/2308.02231

What has ChatGPT read? The origins of archaeological citations used by a generative artificial intelligence application https://arxiv.org/abs/2308.03301

Large Language Model Prompt Chaining for Long Legal Document Classification https://arxiv.org/abs/2308.04138

NLLG Quarterly arXiv Report 06/23: What are the most influential current AI Papers? https://arxiv.org/abs/2308.04889

Through the Lens of Core Competency: Survey on Evaluation of Large Language Models https://arxiv.org/abs/2308.07902

GPTEval: A Survey on Assessments of ChatGPT and GPT-4 https://arxiv.org/abs/2308.12488

Beyond Document Page Classification: Design, Datasets, and Challenges https://arxiv.org/abs/2308.12896

Search and information retrieval

On the Effects of Regional Spelling Conventions in Retrieval Models https://arxiv.org/abs/2308.00480

Generative Query Reformulation for Effective Adhoc Search https://arxiv.org/abs/2308.00415

Evaluation of Conversational Agents for Aerospace Domain https://ceur-ws.org/Vol-2621/CIRCLE20_21.pdf

Large Language Models for Information Retrieval: A Survey https://arxiv.org/abs/2308.07107

Informed Named Entity Recognition Decoding for Generative Language Models https://arxiv.org/abs/2308.07791

Improving Neural Ranking Models with Traditional IR Methods https://arxiv.org/abs/2308.15027

Vector Search with OpenAI Embeddings: Lucene Is All You Need https://arxiv.org/abs/2308.14963

Is ChatGPT a Biomedical Expert? Exploring the Zero-Shot Performance of Current GPT Models in Biomedical Tasks https://arxiv.org/pdf/2306.16108.pdf

Martin White 19 September 2023