Featured Article

Leverage public data to improve content marketing outcomes

Publishers prefer pitches that demonstrate accuracy and authoritativeness

Comment

Crowd walking over binary code
Image Credits: Orbon Alija (opens in a new window) / Getty Images

Kristin Tynski

Contributor

Kristin Tynski is co-founder and SVP creative at Fractl, a growth marketing agency that’s helped Fortune 500 companies and boutique businesses earn quality media coverage, backlinks, awareness and authority.

More posts from Kristin Tynski

Recently I’ve seen people mention the difficulty of generating content that can garner massive attention and links. They suggest that maybe it’s better to focus on content without such potential that can earn just a few links but do it more consistently and at higher volumes.

In some cases, this can be good advice. But I’d like to argue that it is very possible to create content that can consistently generate high volumes of high-authority links. I’ve found in practice there is one truly scalable way to build high-authority links, and it’s predicated on two tactics coming together:

  1. Creating newsworthy content that’s of interest to major online publishers (newspapers, major blogs or large niche publishers).
  2. Pitching publishers in a way that breaks through the noise of their inbox so that they see your content.

How can you use new techniques to generate consistent and predictable content marketing wins?

The key is data.

Techniques for generating press with data-focused stories

It’s my strong opinion that there’s no shortcut to earning press mentions and that only truly new, newsworthy and interesting content can be successful. Hands down, the simplest way to predictably achieve this is through a data journalism approach.

One of the best ways you can create press-earning, data-focused content is by using existing data sets to tell a story.

There are tens of thousands — perhaps hundreds of thousands — of existing public datasets that anyone can leverage for telling new and impactful data-focused stories that can easily garner massive press and high levels of authoritative links.

The last five years or so have seen huge transparency initiatives from the government, NGOs and public companies making their data more available and accessible.

Additionally, FOIA requests are very commonplace, freeing even more data and making it publicly available for journalistic investigation and storytelling.

Because this data usually comes from the government or another authoritative source, pitching these stories to publishers is often easier because you don’t face the same hurdles regarding proving accuracy and authoritativeness.

Potential roadblocks

The accessibility of data provided by the government especially can vary. There are little to no data standards in place, and each federal and local government office has varying amounts of resources in making the data they do have easy to consume for outside parties.

The result is that each dataset often has its own issues and complexities. Some are very straightforward and available in clean and well-documented CSVs or other standard formats.

Unfortunately, others are often difficult to decode, clean, validate or even download, sometimes being trapped inside of difficult to parse PDFs, fragmented reports or within antiquated querying search tools that spit out awkward tables.

Deeper knowledge of web scraping and programmatic data cleaning and reformatting are often required to be able to accurately acquire and utilize many datasets.

Tools to use

  • Google dataset search — Google provides perhaps the most comprehensive way to quickly find datasets with their recently released dataset search.
  • inurl:gov “dataset” — This search query will surface many of the large and important federal and state government datasets, but it’s by no means comprehensive. Still, I find myself using this search string frequently, adding additional keywords outside to narrow my topical focus.
  • Reddit.com/r/datasets — This is one of the largest dataset communities online, and it’s very active, with users posting datasets of all types including public datasets, rare finds and custom scraped data, as well as tools, tricks and tips for finding great data elsewhere online.
  • Data.world — This site aggregates datasets and provides tools to make them more accessible. Additionally they have a great email listserv that surfaces new and interesting datasets as they become available.
  • Data is plural — This is a listserv run by Jeremy Singer-Vine, the data editor at BuzzFeed. It’s a curated list of new and fascinating datasets.
  • Kaggle datasets — Kaggle is the largest data-science and machine learning competition platform online. Its massive community has published thousands of datasets, many of them in easy to consume formats that are pre-cleaned.
  • Data journalism GitHub repositories FiveThirtyEight and the NYTimes Upshot are just two examples of major outlets that make some or all of the data they use in data journalism available on GitHub. This trend is likely to continue, with more and more news publishers creating transparency with their data journalism by making the raw data publicly available.

Tip: Often the juiciest angles in existing datasets are found after becoming very familiar with what the dataset contains and how the data can be mixed/matched to find unique correlations and answer the unanswered questions.

Earn the best backlinks with high-quality content and digital PR

Winning examples of content executed using free data

Some datasets are so large and so comprehensive that there are literally hundreds or thousands of unique and highly newsworthy stories that can be told using their data.

Here are three examples of how you can use these types of datasets to make engaging content that garners high-quality links.

U.S. Census information

Census estimates are released every year on the census.gov website. When creating content for Porch.com, we decided to rank neighborhoods with the highest incomes and home values in each state and see what trends appeared in the names of those neighborhoods.

We did this analysis by gathering data from U.S. Census information for all Census Designated Places (CDPs). A CDP is a concentration of population used for statistical purposes only and is not legally incorporated. We then used QGIS to extract the data from the census and analyzed the data by ZIP code.

With this information, we created and pitched a project called Neighborhood Names.

Image Credits: Yahoo Finance (opens in a new window)

While massive datasets can seem dense and difficult to use, if you focus on particular data with a thesis in mind, you can create a straightforward, simple and popular campaign like this one that earns coverage on publications like CNBC and Realtor.com.

Takeaway: Consider questions you have that large datasets could potentially answer, and then explore those specific angles. Having a narrower focus can really help keep you and your readers from being overwhelmed by information.

U.S. Department of Labor Information

For a different project, we utilized readily available and free information from The U.S. Department of State — Bureau of Consular Affairs and U.S. Department of Labor to formulate a guide to applying, securing and making the most of an H-1B Visa.

We also used government data to examine how H-1B visa holders work in the U.S., how much money they earn and which companies they work for.

The objective was to create a resource by organizing government information in a digestible and interesting way through the creation of maps and by outlining employment opportunities.

The resulting project was featured with Patch.com and other local news sites.

Although this particular set of data applies to content within a very niche vertical, you can combine these government statistics with newsworthy and timely tidbits to attract national and local press.

Takeaway: While this data is already publicly available, consider who can benefit from getting a distilled, clear, digestible version of that data that applies directly to them. Most people aren’t rifling through extensive databases of information, so if you’re able to point out the crucial elements to your audience, you’re providing a value they’ll be very grateful for.

Stand out with data-focused content marketing

As the total volume of content produced online continues to grow exponentially, brands will be increasingly fighting for attention.

Fortunately, the same growth that increases competition also fuels the acceleration of available datasets and new tools for telling compelling stories with that data. Leveling up your content will require that you do more than simply rehash information that already exists and instead add something entirely and previously unknown to the world.

The good news? Those who see the opportunity in data-focused content will reap disproportionate benefits.

2 strategies for creating top-of-funnel marketing content

More TechCrunch

For years, Sammy Faycurry has been hearing from his dietician mom and sister about how poorly many Americans eat and their struggles with delivering nutritional counseling. Although nearly half of…

Dietitian startup Fay has been booming from Ozempic patients and emerges from stealth with $25M from General Catalyst, Forerunner

Apple is bringing new accessibility features to iPads and iPhones, designed to cater to a diverse range of user needs.

Apple announces new accessibility features for iPhone and iPad users

TechCrunch Disrupt, our flagship startup event held annually in San Francisco, is back on October 28-30 — and you can expect a bustling crowd of thousands of startup enthusiasts. Exciting…

Startup Blueprint: TC Disrupt 2024 Builders Stage agenda sneak peek!

Mike Krieger, one of the co-founders of Instagram and, more recently, the co-founder of personalized news app Artifact (which TechCrunch corporate parent Yahoo recently acquired), is joining Anthropic as the…

Anthropic hires Instagram co-founder as head of product

Seven orgs so far have signed on to standardize the way data is collected and shared.

Venture orgs form alliance to standardize data collection

As cloud adoption continues to surge toward the $1 trillion mark in annual spend, we’re seeing a wave of enterprise startups gaining traction with customers and investors for tools to…

Alkira connects with $100M for a solution that connects your clouds

Charging has long been the Achilles’ heel of electric vehicles. One startup thinks it has a better way for apartment dwelling EV drivers to charge overnight.

Orange Charger thinks a $750 outlet will solve EV charging for apartment dwellers

So did investors laugh them out of the room when they explained how they wanted to replace Quickbooks? Kind of.

Embedded accounting startup Layer secures $2.3M toward goal of replacing Quickbooks

While an increasing number of companies are investing in AI, many are struggling to get AI-powered projects into production — much less delivering meaningful ROI. The challenges are many. But…

Weka raises $140M as the AI boom bolsters data platforms

PayHOA, a previously bootstrapped Kentucky-based startup that offers software for self-managed homeowner associations (HOAs), is an example of how real-world problems can translate into opportunity. It just raised a $27.5…

Meet PayHOA, a profitable and once-bootstrapped SaaS startup that just landed a $27.5M Series A

Restaurant365, which offers a restaurant management suite, has raised a hot $175M from ICONIQ Growth, KKR and L Catterton.

Restaurant365 orders in $175M at $1B+ valuation to supersize its food service software stack 

Venture firm Shilling has launched a €50M fund to support growth-stage startups in its own portfolio and to invest in startups everywhere else. 

Portuguese VC firm Shilling launches €50M opportunity fund to back growth-stage startups

Chang She, previously the VP of engineering at Tubi and a Cloudera veteran, has years of experience building data tooling and infrastructure. But when She began working in the AI…

LanceDB, which counts Midjourney as a customer, is building databases for multimodal AI

Trawa simplifies energy purchasing and management for SMEs by leveraging an AI-powered platform and downstream data from customers. 

Berlin-based trawa raises €10M to use AI to make buying renewable energy easier for SMEs

Lydia is splitting itself into two apps — Lydia for P2P payments and Sumeria for those looking for a mobile-first bank account.

Lydia, the French payments app with 8 million users, launches mobile banking app Sumeria

Cargo ships docking at a commercial port incur costs called “disbursements” and “port call expenses.” This might be port dues, towage, and pilotage fees. It’s a complex patchwork and all…

Shipping logistics startup Harbor Lab raises $16M Series A led by Atomico

AWS has confirmed its European “sovereign cloud” will go live by the end of 2025, enabling greater data residency for the region.

AWS confirms will launch European ‘sovereign cloud’ in Germany by 2025, plans €7.8B investment over 15 years

Go Digit, an Indian insurance startup, has raised $141 million from investors including Goldman Sachs, ADIA, and Morgan Stanley as part of its IPO.

Indian insurance startup Go Digit raises $141M from anchor investors ahead of IPO

Peakbridge intends to invest in between 16 and 20 companies, investing around $10 million in each company. It has made eight investments so far.

Food VC Peakbridge has new $187M fund to transform future of food, like lab-made cocoa

For over six decades, the nonprofit has been active in the financial services sector.

Accion’s new $152.5M fund will back financial institutions serving small businesses globally

Meta’s newest social network, Threads, is starting its own fact-checking program after piggybacking on Instagram and Facebook’s network for a few months.

Threads finally starts its own fact-checking program

Looking Glass makes trippy-looking mixed-reality screens where things look 3D without the need of special glasses. Today it launches a pair of new displays, including a 16-inch mode that runs…

Looking Glass launches new 3D displays

OpenAI co-founder and chief scientist Ilya Sutskever has left the company. Replacing Sutskever is Jakub Pachocki, OpenAI’s director of research.

Ilya Sutskever, OpenAI co-founder and longtime chief scientist, departs

Intuitive Machines made history when it became the first private company to land a spacecraft on the moon, so it makes sense to adapt that tech for Mars.

Intuitive Machines wants to help NASA return samples from Mars

As Google revamps itself for the AI era, offering AI overviews within its search results, the company is introducing a new way to filter for just text-based links. With the…

Google adds ‘Web’ search filter for showing old-school text links as AI rolls out

Blue Origin’s New Shepard rocket will take a crew to suborbital space for the first time in nearly two years later this month, the company announced on Tuesday.  The NS-25…

Blue Origin to resume crewed New Shepard launches on May 19

This will enable developers to use the on-device model to power their own AI features.

Google is building its Gemini Nano AI model into Chrome on the desktop

It ran 110 minutes, but Google managed to reference AI a whopping 121 times during Google I/O 2024 (by its own count). CEO Sundar Pichai referenced the figure to wrap…

Google mentioned ‘AI’ 120+ times during its I/O keynote

Firebase Genkit is an open source framework that enables developers to quickly build AI into new and existing applications.

Google launches Firebase Genkit, a new open source framework for building AI-powered apps

In the coming months, Google says it will open up the Gemini Nano model to more developers.

Patreon and Grammarly are already experimenting with Gemini Nano, says Google