One-off tests don’t measure AI’s true impact. We’re better off shifting to more human-centered, context-specific methods.
Abstract: Recent advances in large language models (LLMs) have enabled promising performance in unit test generation through in-context learning (ICL). However, the quality of in-context examples ...