Back to our thinking

Gemini vs. ChatGPT: What produces better marketing content?

Back to our thinking

9 min read

There’s no denying that generative AI is becoming more capable. So capable, in fact, that many organizations are thinking it can replace their in-house teams when it comes to developing articles, brochures and other communications material. But how realistic are those expectations, given that there’s no real intelligence under the hood of any GenAI model?

We tested two of the most prominent GenAI chatbots, OpenAI’s ChatGPT and Google Gemini, to see if they can actually produce compelling content — and replace the judgment and expertise of human writers.

Task 1: Cutting down an article

One advantage of writing a longer article is that you can trim it down and repurpose the content for other channels, like social media. So how well can GenAI handle this relatively straightforward copyediting task?

To find out, we uploaded into Gemini and ChatGPT a 1,500-word article on how a wireless networking technology could be used by a specific industry, then asked the chatbots to condense it into something that was just 500 to 600 words. We purposely gave minimal instructions beyond cutting down the length, wanting to assess the chatbots’ ability to generate text on par with a professional writer’s judgment about what to keep and what to discard from the original draft. Here’s what we found:

Length: Despite Gemini telling us it prepared a 550-word version of the original article, the draft it generated was about 700 words. While Gemini failed to comply with our instructions, ChatGPT had no problems, with its version coming in at just under 600 words.

Quality: Gemini kept the original article’s introduction and conclusion largely intact, saving on wordcount by removing most of the content from the middle of the article — including stripping out nearly all the crunchy details about the industry-specific uses cases and applications of the technology. On top of that, Gemini used no subheads or bullets to make its content more scannable. We gave Gemini a second chance by tweaking our prompt (reiterating the wordcount limit and asking it to keep the use case details), resulting in a 500-word version that mirrored the structure of the original article, including its subheads and bullets. However, the content ended up very choppy, with no real flow from one idea to the next.

In comparison, ChatGPT’s draft included subheads and bullets right away, also mirroring the structure of the original article. (Interestingly, neither chatbot used the exact same subheads as the original.) ChatGPT’s version read more conversationally with better transitions between sentences and paragraphs. It also preserved the mentions of our client in the introduction and conclusion.

Winner: ChatGPT for getting closer to the mark on one try. However, it still simply cut down the article while keeping the same subsections. Because a chatbot generates text probabilistically rather than with any kind of intelligence, it isn’t capable of adapting or reinventing the article’s structure to better accommodate the much shorter wordcount (as our team would do if given the same exercise), resulting in some ideas that weren’t very fleshed out.

Task 2: Writing an article from meeting notes

We conduct a lot of interviews with subject-matter experts to get the information and messaging we need to write our content. Can a chatbot to take our interview notes and isolate the most relevant details — discerning the core conversation from the unnecessary sidebars — and then generate something that matches the style of an example article?

To test this, we uploaded our notes from an interview with a researcher as well as an article to use as a model. We then told the chatbots to generate a 300- to 350-word article explaining what the researcher is doing, why it matters, what’s innovative about the work and the impact it might have. Here are the results:

Length: Gemini again had difficulty following the instructions, producing a draft that was about 415 words. ChatGPT came in at 315 words.

Quality: Both Gemini and ChatGPT found the relevant information in our interview notes and generated a very similar story. However, Gemini presented the overarching research challenge in a more direct, engaging and easy-to-understand way, using more concrete examples from our notes than ChatGPT. However, ChatGPT’s conclusion was stronger, as was its main title.

The main issue across both chatbots is that they used our detailed prompt as an outline to be followed to the letter. Gemini was more blatant in this regard, starting its paragraphs with phrases like, “This research is important because…”, “What makes this work particularly innovative is…” and “The potential impact of this research is…” This kind of robotic, formulaic structure is a tell-tale sign of AI-generated content. Also, neither chatbot was able to follow the model article exactly, with their drafts not using subheads or quotes attributed to the researcher as found in the example.

Winner: Gemini for more accurately presenting the information from our notes, even if its structure was less natural-sounding than ChatGPT due to its strict adherence to our prompt. That said, both chatbots were missing the kind of storytelling that a human writer can bring. For a more engaging read, our team would weave in the elements of the prompt throughout the article instead of dealing with them in a very prescribed order.

Task 3: Developing a brochure

Writing a blog is one thing. What about copy that will go into design and layout, like a brochure promoting a service or solution? Can a chatbot replicate something with a pre-defined template and established sections for presenting features, benefits and so on?

We uploaded into both chatbots an example of how we present copy in Word for a two-page PDF brochure. We then uploaded multiple background documents, including a PowerPoint deck, to see if they could not only find the appropriate information but format it correctly as well.

Format and length: Both chatbots copied the format of the sample brochure accurately, although the prompt we provided helped in that regard, listing the exact sections to be included. (We had to revise our prompt because, with the first one we tried, Gemini took the word “brochure” and defaulted to a printed, three-fold brochure, arranging its copy to fit within six panels.) Interestingly, Gemini was again wordier than ChatGPT, with its draft being nearly 200 words longer, despite both using the same model as a reference.

Quality: The chatbots managed to extract the most useful information from the background sources, with both generating similar sets of benefits and features. However, ChatGPT did a better job ensuring all the benefit statements were parallel in construction (e.g., all starting with verbs). ChatGPT’s introduction more accurately described the challenge addressed by the solution and the overall value proposition. Shorter sentences and a more casual tone also helped make ChatGPT’s brochure an easier read. While ChatGPT’s main title and subtitle were better (Gemini didn’t even include the solution name), Gemini’s call to action at the end of the brochure was more engaging.

Winner: ChatGPT for doing a better job replicating both the format and style of the sample. However, what was produced would not be considered client-ready, as it would still require a fair bit of polishing and editing to match the client’s tone of voice.

Task 4: Editing website content

Finally, we wanted to get a sense of how well GenAI could copyedit existing content while adhering to the specific rules outlined in a client’s style guide.

To do so, we provided both Gemini and ChatGPT with a Word document containing draft content for a webpage, as well as a PDF version of the style guide. We instructed the chatbots to edit the content to make it easier to read for a general audience — more user-friendly, less technical — but without rewriting or restructuring the content in a significant way. Our prompt also specified which pages of the style guide were most important, reminding the chatbots to pay close attention to rules related to capitalization, punctuation, treatment of numbers and so on.

Style adherence: Despite our instructions, both Gemini and ChatGPT ended up missing certain rules from the style guide. Gemini missed the one about adding spaces around em-dashes, for instance, and both let some Oxford commas slip through. Interestingly, the ability to follow the style guide changed depending on which chatbot mode was being used. When not in Canvas mode, Gemini missed or misinterpreted several more rules — so instead of writing out all numbers under 10, for example, it wrote out every number in the document, resulting in unnecessary edits like changing “160,000” to “one hundred and sixty thousand”.

Quality: The chatbots were pretty good at applying user-friendly language, changing most instances of the client’s name to “we” or “our”, and instances of “the customer” to “you”. Both also split up several long sentences. One advantage for Gemini Canvas is its ability to ask follow-up questions. As an example, it noticed duplicate content and asked whether we wanted it merged with similar content or deleted entirely, then made the changes in real time based on our response. ChatGPT, meanwhile, opted for plainer language than Gemini in some cases (e.g., turning “consider” into “choose”), but in other cases cut or truncated the original text too much, resulting in copy that was choppy or disjointed.

Winner: ChatGPT, but not for the quality of the edit, as the two chatbots were about equal in that regard (and still way behind the precision and consistency our editors can bring to the task). Unlike Gemini, which only works within its chat window, ChatGPT can output as a Word document, which is more useful for our needs. Unfortunately, it can’t actually use the Track Changes functionality of Word, instead indicating deletions and insertions through strikethroughs and bolding. That ultimately makes it not very usable on our end, as we’d have to re-apply the edits manually, with the changes tracked, to produce a document that is client-ready.

Our conclusions

Overall, ChatGPT seems to be better suited for generating marketing content than Google Gemini, although the gap between the two is not large.

So do we believe GenAI chatbots can replace human writers? No. That’s because while GenAI can simulate human intelligence remarkably well, it is just a simulation. In truth, GenAI models don’t do any “thinking” at all. Instead, they draw on enormous datasets to determine the most likely response to any given prompt, like your phone’s autocomplete functionality taken to the next level.

Ultimately, that means GenAI is incapable of producing anything new — making it a non-starter for thought leadership. It also makes it difficult for your content to stand out, with GenAI defaulting to certain phrases too often and lacking in both substance and storytelling. And then there’s the issue of AI hallucination, with chatbots increasingly producing plausible-sounding but incorrect information or obviously false statements.

It’s also unclear how much time is actually saved by using GenAI. How much time will it take to test many different prompts and evaluate the outputs until you get something close to what you want? And then how long will it take to edit the output so it reflects your brand voice? Or to double-check that the chatbot accurately interpreted the information you fed into it, especially when dealing with highly technical or specialized topics?

When you add it all up, it’s better to hire a team like Ascribe to do the writing for you. Because when quality counts and your reputation is on the line, AI still isn’t worth the risk.

Contact us to help you with your content

Back to our thinking

Continue Reading

Articles

6 min read

What our actual clients are saying about GenAI tools

How are Ascribe’s clients using GenAI? And where does that leave our team? We conducted a survey to find out.

Articles

5 min read

Six tips for writing better blogs

The pressures of content marketing require organizations to push out a steady, regular stream of consumable material. Blogs are a handy way to do that — and when done well, they’re a great tool for improving your search engine rankings, demonstrating your thought leadership and generating more leads.

Articles

5 min read

Can AI write good marketing copy? Part 4: The state of GenAI in 2025

Has AI become any better of a solution for your marketing copy? In some ways, yes. But a lot of the core issues remain.

Articles

6 min read

Why we don't use AI at Ascribe

Find out why Ascribe doesn’t use AI in its work and learn about the value human writers bring to the table.

Articles

4 min read

The environmental impacts of AI

GenAI makes content development fast and easy, but it comes with significant environmental impacts.

Articles

4 min read

7 tips for being a thought leader

Organizations are under intense pressure to generate fresh content on a continuous basis. But not just any content will do: audiences are looking for genuine thought leadership to guide their strategies and decisions.

Articles

5 min read

Content writing trends for 2025

More so than many other fields, marketing communications is driven by trends. What’s best practice at one time could fall out of favour as audience preferences, especially as they relate to technology, shift and evolve.