<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Gemini on Commentary of Takao</title><link>https://takao.blog/en/tags/gemini/</link><description>Recent content in Gemini on Commentary of Takao</description><generator>Hugo -- gohugo.io</generator><language>en</language><copyright>Commentary of Takao</copyright><lastBuildDate>Sat, 13 Jun 2026 23:11:50 +0900</lastBuildDate><atom:link href="https://takao.blog/en/tags/gemini/index.xml" rel="self" type="application/rss+xml"/><item><title>Building Apps using Gemini 1.5 Pro's Massive Context Length</title><link>https://takao.blog/en/web/gemini-api-pro-latest-utilization/</link><pubDate>Thu, 05 Mar 2026 00:00:00 +0900</pubDate><guid>https://takao.blog/en/web/gemini-api-pro-latest-utilization/</guid><description>&lt;img src="https://takao.blog/img/thumnail.webp" alt="Featured image of post Building Apps using Gemini 1.5 Pro's Massive Context Length" /&gt;&lt;h2 id="the-context-window-revolution"&gt;The Context Window Revolution
&lt;/h2&gt;&lt;p&gt;Gemini 1.5 Pro redefined what&amp;rsquo;s possible with large language models by offering a context window of up to &lt;strong&gt;2 million tokens&lt;/strong&gt;. This means you can pass entire codebases, hours of video, or thousands of pages of documents in a single request — fundamentally changing how we interact with AI.&lt;/p&gt;
&lt;h2 id="understanding-gemini-15-pros-capabilities"&gt;Understanding Gemini 1.5 Pro&amp;rsquo;s Capabilities
&lt;/h2&gt;&lt;table&gt;
	&lt;thead&gt;
			&lt;tr&gt;
					&lt;th&gt;Feature&lt;/th&gt;
					&lt;th&gt;Capability&lt;/th&gt;
			&lt;/tr&gt;
	&lt;/thead&gt;
	&lt;tbody&gt;
			&lt;tr&gt;
					&lt;td&gt;Context window&lt;/td&gt;
					&lt;td&gt;Up to 2M tokens (1M standard)&lt;/td&gt;
			&lt;/tr&gt;
			&lt;tr&gt;
					&lt;td&gt;Input modalities&lt;/td&gt;
					&lt;td&gt;Text, image, audio, video, code&lt;/td&gt;
			&lt;/tr&gt;
			&lt;tr&gt;
					&lt;td&gt;Output&lt;/td&gt;
					&lt;td&gt;Text, code, structured data&lt;/td&gt;
			&lt;/tr&gt;
			&lt;tr&gt;
					&lt;td&gt;Max output tokens&lt;/td&gt;
					&lt;td&gt;8,192&lt;/td&gt;
			&lt;/tr&gt;
			&lt;tr&gt;
					&lt;td&gt;Languages&lt;/td&gt;
					&lt;td&gt;100+ languages&lt;/td&gt;
			&lt;/tr&gt;
			&lt;tr&gt;
					&lt;td&gt;Pricing (input)&lt;/td&gt;
					&lt;td&gt;$1.25–$10.00 per 1M tokens&lt;/td&gt;
			&lt;/tr&gt;
			&lt;tr&gt;
					&lt;td&gt;Pricing (output)&lt;/td&gt;
					&lt;td&gt;$10.00–$40.00 per 1M tokens&lt;/td&gt;
			&lt;/tr&gt;
	&lt;/tbody&gt;
&lt;/table&gt;
&lt;h2 id="multimodal-input-handling"&gt;Multimodal Input Handling
&lt;/h2&gt;&lt;p&gt;Gemini 1.5 Pro natively processes multiple modalities in a single request. You can combine text, images, audio, and video seamlessly:&lt;/p&gt;</description></item></channel></rss>