Browse the best AI prompts for Gemini for Data Analysis. All tested, copy-paste ready, and free to use.
The best copy-paste AI prompts to complete your Gemini for Data Analysis from start to finish.
Browse the best AI prompts for Gemini for Data Analysis. All tested, copy-paste ready, and free to use.
The best free AI prompts for Gemini for Data Analysis, organized by stage. This guide covers understand and prepare the data, run the analysis, interpret the results, and more, with copy-paste ready prompts for every skill level. Pick your stage, copy a prompt, and get results right away.
Stage 1
Before analysis, you need to understand what data you have and whether it is ready to use. These prompts help you assess and clean your dataset.
Describe and assess a dataset
Here is a description of my dataset: [DESCRIBE COLUMNS, ROW COUNT, SOURCE]. Help me assess: what analyses are possible with this data, what it cannot tell me, any obvious quality issues I should fix before analysis, and what additional data would significantly improve what I can learn.
Write data cleaning steps
I have a dataset with these quality issues: [DESCRIBE ISSUES, e.g. missing values in X column, duplicate rows, inconsistent date formats]. Write the specific cleaning steps and code [IN PYTHON/SQL/SHEETS FORMULA] to fix each issue. Explain the trade-offs for each approach, especially how to handle missing values.
Write Google Sheets formulas for data prep
I have a Google Sheet with this structure: [DESCRIBE COLUMNS AND LAYOUT]. I need to: [DESCRIBE WHAT YOU WANT TO DO, e.g. calculate rolling averages, flag duplicates, join two sheets on a key column]. Write the specific formulas, where to put them, and how they work.
Write a SQL query for data extraction
Write a SQL query to extract [DESCRIBE WHAT YOU NEED] from this database schema: [DESCRIBE TABLES AND COLUMNS]. The query should: [LIST REQUIREMENTS, e.g. filter by date range, aggregate by category, join on customer_id]. Optimize for readability and include comments explaining the logic.
Identify data quality issues
Read this sample of my dataset: [PASTE SAMPLE OR DESCRIBE STRUCTURE]. Identify any data quality issues: outliers that might be errors, impossible values, inconsistent formatting, referential integrity problems, or missing data patterns. For each issue, tell me how to investigate it and how to handle it in analysis.
Stage 2
These prompts help you run the right analysis for your question, whether in Python, SQL, or Google Sheets.
Choose the right analysis method
I want to answer this business question: [DESCRIBE QUESTION] using this data: [DESCRIBE DATA]. What is the most appropriate analysis method? Give me two or three options with their trade-offs. For the best option, describe what output it produces and what I need to interpret it correctly.
Write a Python analysis script
Write a Python script to analyze [DESCRIBE DATASET AND QUESTION]. Use pandas and appropriate visualization libraries. The script should: load the data from [FILE FORMAT], perform [SPECIFIC ANALYSES], produce [SPECIFIC OUTPUTS], and save results to [OUTPUT FORMAT]. Include error handling and comments.
Analyze data in Google Sheets
I have data in Google Sheets: [DESCRIBE STRUCTURE]. I want to: [DESCRIBE ANALYSIS]. Walk me through doing this analysis entirely in Sheets, using built-in functions or a Gemini-assisted approach. Provide the exact formulas or steps, and tell me how to check that the results are correct.
Write a BigQuery analysis
Write a BigQuery SQL analysis for [DESCRIBE BUSINESS QUESTION] using this table structure: [DESCRIBE SCHEMA]. Include: the main analysis query, any intermediate CTEs needed for clarity, and a brief interpretation of what each step produces. Optimize for cost by minimizing data scanned.
Run a cohort or segmentation analysis
I want to run a [COHORT/SEGMENTATION/FUNNEL] analysis on this data: [DESCRIBE DATA]. Write the analysis code in [PYTHON/SQL] that produces: [DESCRIBE OUTPUTS, e.g. retention by cohort, conversion by segment, drop-off by funnel stage]. Include the visualization code.
Stage 3
Numbers do not speak for themselves. These prompts help you move from output to insight.
Interpret analysis results
Here are the results of my analysis: [PASTE RESULTS OR DESCRIBE]. The business question I was trying to answer: [DESCRIBE]. Interpret these results: what do they actually mean in plain language, what is the most important finding, what limitations of the analysis should I flag, and what does this imply I should do?
Identify the key drivers
I have data showing [METRIC] changed by [AMOUNT] over [PERIOD]. Here is the data: [PASTE OR DESCRIBE]. Help me identify the key drivers of this change. What analysis would decompose this change into contributing factors? Write the analysis code and interpret the output.
Spot anomalies and outliers
Read this data and identify any anomalies, outliers, or unexpected patterns: [PASTE DATA OR RESULTS]. For each one, tell me: what makes it unusual, whether it is more likely to be a real finding or a data error, and what I should investigate to understand it.
Validate an A/B test result
I ran an A/B test with these results: [DESCRIBE RESULTS, sample sizes, conversion rates or metric values for control and variant]. Help me: calculate statistical significance, determine if the result is practically meaningful given sample size, identify any threats to validity in my test design, and decide whether to ship the variant.
Draw business conclusions
Here are my data findings: [DESCRIBE FINDINGS]. My stakeholder audience is [DESCRIBE]. What are the three most important business implications of these findings? What actions should we take, what should we investigate further, and what should we stop doing? Be direct and specific.
Stage 4
Analysis that does not communicate clearly does not drive decisions. These prompts help you build reports and presentations from your findings.
Write an analysis summary
Write a data analysis summary for [AUDIENCE, e.g. executive team / product team / investors]. Key findings: [LIST]. Include: the business question, the key metrics, the most important finding, the recommended action, and the key caveats. Under 300 words.
Design a Looker Studio dashboard
I want to build a Looker Studio dashboard tracking [BUSINESS AREA] for [AUDIENCE]. Based on my data structure: [DESCRIBE], recommend: the five most important metrics to track, the best visualization type for each, how to arrange the layout for a daily business review, and what filters to include.
Write data-driven talking points
I am presenting these data findings: [DESCRIBE] to [AUDIENCE]. Write five data-driven talking points I can use in the presentation. Each should: cite a specific number, explain what it means in context, and connect to a business implication. Write them as spoken sentences, not bullet points.
Create a data story structure
I have these data findings: [DESCRIBE]. Help me structure them as a data story for a presentation. Use the Problem-Evidence-Insight-Action structure: (1) the business problem that motivated the analysis, (2) the key evidence from the data, (3) the insight that emerges, (4) the recommended action. Write the narrative arc.
Document the analysis for reproducibility
Write documentation for this analysis so another analyst can reproduce it: [DESCRIBE ANALYSIS AND CODE]. Include: the business question, the data sources and access details, the processing steps, the analysis methodology, the interpretation logic, and known limitations. Format as a shareable analysis README.
In Google Sheets, you can use Gemini in the side panel to ask questions about your data, generate formulas, and create charts. In Gemini Advanced, you can upload a sheet as a file and ask analytical questions directly. For large-scale analysis, the Gemini API connects to BigQuery for SQL-based data work.
Yes. Gemini writes pandas, NumPy, matplotlib, and scikit-learn code well. Provide the data structure clearly (describe column names and types) and specify the exact output you want. For Google Cloud data, Gemini has strong knowledge of BigQuery client libraries and Vertex AI.
Gemini for Google Cloud integrates directly into the BigQuery console and can generate, explain, and optimize SQL queries. You can also use Gemini Advanced to write BigQuery queries by describing your schema and analysis goal. Specify "BigQuery SQL syntax" to get compatible query syntax.
Gemini handles descriptive statistics, regression, A/B testing, cohort analysis, and common machine learning workflows well. For specialized statistical methods, verify that it is using the correct approach for your data distribution and sample size. Always review the reasoning behind the method choice, not just the code output.
Yes. Gemini can recommend dashboard layouts, suggest metrics and visualizations, and help you write calculated fields in Looker Studio. Describe your data source and the audience for the dashboard, and ask for a layout recommendation with the specific chart types for each metric.
AI Prompts for Gemini for Research
Gemini is one of the strongest research tools available because it combines a large knowledge base with Google Search grounding, Deep Research mode, and the ability to synthesize across multiple documents simultaneously.
See promptsAI Prompts for Gemini for Coding
Gemini is a strong coding assistant with particularly deep integration into Google's development ecosystem: Firebase, Google Cloud, Android Studio, and Google APIs.
See prompts