Generative Engine Optimization (GEO) and How to Optimize for AI Search Results [Princeton Study]

Yesterday I found this study about how it might be possible to optimize against LLM-based search engines (think Google Bard, Google SGE, Bing, ChatGPT, etc), AI Search or Generative Engines as the study calls them.

They coin this new approach: Generative Engine Optimization (GEO).

The research, conducted by teams from Princeton University, Georgia Tech, Allen Institute for AI, and IIT Delhi, evaluates which strategies that are effective in order to improve visibility in these types of search engines.

And this is interesting, as traditional SEO methods may not yield the same results with these new-age generative engines, which provide direct, comprehensive responses and could potentially decrease organic traffic to websites.

Therefore, SEO professionals must understand and adapt to this new paradigm as we will see more and more of this type of AI in Google search engine in 2024 and beyond.

By leveraging Generative Engine Optimization (GEO) methods, such as including citations, quotations from relevant sources, and statistics, SEOs can significantly boost a website's visibility in AI search results according to the study.

I previously discussed this a bit when Google Bard was introduced, explaining how its optimization could resemble strategies used for enhancing featured snippets.

The main findings of the study were:

Focus on Impressions Metrics: Traditional metrics used in search engine optimization (SEO) are no longer sufficient for generative engines. Instead, GEO proposes a set of impression metrics that measure the visibility of citations and their relevance to the user query.
Include including citations and quotations: The study evaluates various GEO methods and their effectiveness in improving source visibility. Notably, methods such as including citations, quotations from relevant sources, and statistics significantly boost source visibility by up to 40% in generative engine responses.
Domain-Specific Optimization: The study demonstrates the importance of domain-specific optimization strategies. Different GEO methods perform better in certain domains, highlighting the need for targeted adjustments to enhance visibility. Like authoritative language worked best for improving historical content, citation optimization benefits factual queries, and statistics enhance law and government topics.

If you want the more meaty details, I have tried to outline them in the following.

What is AI Search or generative engines and why does it matter?

AI Search or Large Language Models (LLMs) represent the next generation of search engine technology.

These advanced systems, such as BingChat, Google's SGE, Perplexity or to degree ChatGPT, merge the capabilities of traditional search engines with the adaptability of generative models.

These new types of search engine, known as generative engines (GE), go beyond simply searching for information.

They generate multi-modal responses by synthesizing information from multiple sources.

Generative engines work by retrieving relevant documents from a database, such as the internet, and using large neural models to generate a response.

This response is grounded in the sources, ensuring attribution and providing a way for the user to verify the information.

While these engines offer significant advantages to both developers and users, they also pose challenges for website and content creators.

Unlike traditional search engines, generative engines can provide direct, comprehensive responses, which could lead to a decrease in organic traffic to websites and impact their visibility.

How the study compares classic search engines with generative engines (AI Search)

But, again Google have done this for quite some time with their more and more frequent featured snippet.

But of course GE or SGE takes this to a whole new level.

And as these engines continue to evolve, they will undoubtedly play a pivotal role in shaping the future of search technology.

The study: “GEO: Generative Engine Optimization”

In the study simply named “GEO: Generative Engine Optimization”, the researchers conducted experiments to evaluate the effectiveness of different Generative Engine Optimization (GEO) methods.

They used a benchmark called GEO-BENCH, which consisted of 10,000 diverse queries from different sources and domains.

Dataset	Description
MS Macro	Contains real anonymized user queries from Bing and Google Search Engines.
ORCAS-1	Contains real anonymized user queries from Bing and Google Search Engines.
Natural Questions	Contains real anonymized user queries from Bing and Google Search Engines.
AllSouls	Contains essay questions from "All Souls College, Oxford University". Requires Generative Engines to perform appropriate reasoning to aggregate information from multiple sources.
LIMA	Contains challenging questions requiring Generative Engines to not only aggregate information but also perform suitable reasoning to answer the question (e.g., writing a short poem, python code.).
Davinci-Debtate	Contains debate questions generated for testing Generative Engines.
Perplexity.ai Discover	Queries are sourced from Perplexity.ai’s Discover section, which is an updated list of trending queries on the platform.
ELI-5	Contains questions from the ELI5 subreddit, where users ask complex questions and expect answers in simple, layman’s terms.
GPT-4 Generated Queries	To supplement diversity in query distribution, GPT-4 is prompted to generate queries ranging from various domains (e.g., science, history) and based on query intent (e.g., navigational, transactional) and based on difficulty and scope of generated response (e.g., open-ended, fact-based).

‍

For each query in the benchmark, the researchers randomly selected a source website and applied one of the GEO methods separately to optimize the content of that source. They generated multiple answers per query to ensure statistical reliability.

The performance of the GEO methods was evaluated using two metrics: Position-Adjusted Word Count and Subjective Impression.

The Position-Adjusted Word Count metric considered the word count and position of the citation in the generative engine's response.

The Subjective Impression metric incorporated multiple subjective factors to compute an overall impression score.

The relative improvement in impression for each source was calculated by comparing the impression scores of the optimized response to the baseline response without any optimization.

Additionally, the researchers analyzed the performance of the GEO methods across different categories and domains.

They identified the top-performing categories for each method, indicating the specific contexts in which each method was most effective.

The study used the Perplexity.ai search engine and an AI search engine modeled on Bing Chat. The researchers found that the results were similar across both platforms.

They evaluated the GEO methods on a subset of 200 samples from the test set to assess their performance in a real-world generative engine scenario.

The nine different optimization techniques evaluated

The researchers evaluated nine different GEO methods to optimize website content for generative engines.

To me it seems like these methods is a mix of classic SEO techniques (think keyword-usage, E-E-A-T, semantic richness, external links etc)

These 9 methods were:

Authoritative: Modifies the text style of the source content to be more persuasive and authoritative, making claims with confidence.
Keyword Stuffing: Modifies content to include more keywords from the user query, similar to traditional SEO optimization strategies.
Statistics Addition: Modifies content to include quantitative statistics instead of qualitative discussion wherever possible, adding data-driven evidence.
Cite Sources: Adds relevant citations from credible sources to support claims and provide attribution throughout the website content.
Quotation Addition: Incorporates quotations from relevant sources to enhance the authenticity and depth of the website content.
Easy-to-Understand: Simplifies the language and structure of the website content, making it more accessible and appealing to the generative engine and users.
Fluency Optimization: Improves the fluency and readability of the website text, ensuring a smooth and coherent reading experience.
Unique Words: Adds unique and intriguing vocabulary to the website content, making it stand out and increasing its appeal.
Technical Terms: Incorporates technical terms and jargon relevant to the domain or industry, demonstrating expertise and catering to specific audiences.

Example of how the researches implemented GEO

The most effective GEO methods

They found that some methods were more effective in certain domains, while three strategies proved successful across all types of sites.

These top three strategies were Cite Sources, Quotation Addition, and Statistics Addition. These methods, requiring minimal changes to the actual content, improved the website's visibility by 30-40% compared to the baselines.

Interestingly, the researchers found that the effectiveness of optimization strategies varied depending on the knowledge domain.

For instance, the "Authoritative" optimization, which uses more persuasive language, worked best for content related to the Historical domain.

Meanwhile, the Citation optimization was most effective for factual search queries, and adding statistics proved beneficial for Law and government-related questions.

The research also revealed that some strategies were less effective than anticipated.

Using persuasive and authoritative tones in the content did not generally improve rankings in AI search engines.

Similarly, adding more keywords from the search query into the content (what we in classic SEO know as keyword stuffing if overdone), was not effective and performed worse than the baseline by 10%.

Are we to trust the study? I am not so sure

The researchers state that websites that are traditionally lower-ranked in SERP could significantly improve their visibility using GEO methods.

For instance, that the Cite Sources method led to a substantial 115.1% increase in visibility for websites ranked fifth in SERP.

This to me seems a bit random or too massive a change, which leads me to suspect that their methods are not 100% bulletproof.

Also they label “lower-ranked websites” as someone in fifth place in the SERPs. And that “many of these lower-ranked websites are often created by small content creators or independent businesses”.

To me, this misrepresents a misunderstanding when it comes to SEO. Lower-ranked websites are out of the top 10 for most queries, as today it requires a lot just to get to the first page. And most often first pages do not have small content creators in the first page.

Still, I find their study interesting, and I hope we see more of this as we move into the new era of GE or SGE.

While the study suggests that Generative Engine Optimization (GEO) could level the playing field for small content creators and independent businesses, there's a contrasting viewpoint that AI search or generative engines might instead favor larger, more credible websites.

This could potentially widen the gap between these entities and smaller businesses in the digital space.

Want to try the #1 AI Toolkit for SEO teams?

Our AI SEO assistants helps write and optimize everything - from descriptions and articles to product feeds - so they appeal to both customers and search engine algorithms. Try it now with a free trial→

Generative Engine Optimization (GEO) and How to Optimize for AI Search Results [Princeton Study]

This is an article written by:

Daniel Højris Bæk

Co-founder, SEO.ai

+20 years of experience from various digital agencies. Passionate about AI (artificial intelligence) and the superpowers it can unlock. I had my first experience with SEO back in 2001, working at a Danish web agency.

» See all articles

» Linkedin

» X (Twitter)

Join +75.000 others for monthly insights on SEO and artificial intelligence. Crafted by industry experts.

Generative Engine Optimization (GEO) and How to Optimize for AI Search Results [Princeton Study]

What is AI Search or generative engines and why does it matter?

The study: “GEO: Generative Engine Optimization”

The nine different optimization techniques evaluated

The most effective GEO methods

Are we to trust the study? I am not so sure

Want to try the #1 AI Toolkit for SEO teams?

Latest articles

14 Advertising Optimizers for Google [That Don't Suck]

How to Optimize Google Shopping Ads in 2025 [And 7 Tips]

How to Do Google Ads for Ecommerce in 2025 [All the Info You Need]

Other related articles

Odoo vs Linnworks: Which is the Best for Your Online Shop in 2025?

2024's SEO Conferences & Events

What Are Custom Labels and How Can They Help Segment My Feed for Better Bidding or Reporting?

Should You Outsource SEO? (2025)

SEO for Construction Companies: 8 Easy Tips to Drive More Local Leads

ChatGPT for SEO examples and use cases. 13 examples of application

Free SEO tools

Ranking SEO Check

Keyword Rank Checker

Meta Description Generator

SEO Analyzer

Broken Link Checker

Dead Link Checker

Convert Google Sheet to HTML Table Tool

AI Product Description Generator