03/10/2026 | News release | Distributed by Public on 03/10/2026 15:04
The MTA has a vast library of data covering a wide range of topics including service performance, budget, capital assets, workforce, and more-all of which is valuable to the public. But it wasn't always this way. Before 2021, most MTA data was only released in non-open formats, such as PDFs, and through Freedom of Information Law (FOIL) requests (except for real-time vehicle arrival data feeds, which we have published in open format since 2012). Ever since Governor Kathy Hochul and New York State Legislature enacted the MTA Open Data Law in 2021, the MTA's open data program has grown rapidly. In fact, it's now an award-winning program that can serve as a model for other government agencies standing up their own open data programs.
Good open data programs improve government transparency and accountability by making data easier to understand and work with for riders, researchers, and curious customers. Equally important, open data is crucial for making data easier to work with internally. You can read more about the tangible, organizational benefits to better managed and more accessible data in this previous lessons learned blog post.
In this next lessons learned post, we offer a guide on how to launch an open data program in government, using best practices we learned in the first years of establishing our program. The MTA's open data program has come a long way in five years, and we hope this resource helps other government agencies across the United States follow suit!
2021 was a turning point for MTA open data. Before then, data was regularly shared publicly in PDF or static Excel files. We only had 21 published open data sets available to the public.
The passage of the MTA Open Data Law in 2021 officially mandated the MTA to publish its data in open, machine-readable formats. Since then, our offerings have increased, with over 230 open data assets available to the public today. And we've conquered some technically complex and high-profile topics too, like the operating budget (the first transit agency in the United States to do so!), subway rider origin-destination ridership, and traffic counts in the Manhattan congestion relief zone.
As mentioned above, the governor and state legislature passed the MTA Open Data Law in 2021, officially requiring the MTA to publish information in an open format. For the MTA, this mandate was the catalyst for establishing a formal program and freeing up the resources needed to get the program off the ground. The MTA's open data law contains language that is common across many open data policies and statutes. A few pieces of the legislation were particularly helpful in establishing how the program would work:
A mandated annual report to document progress is common in other open data statutes. While not included in our statutory requirements, we still publish an annual update on our program page, which we have found incredibly valuable in memorializing our success.
Identifying who needs to be part of the immediate team, and who needs to be engaged in the surrounding circle, is one of the first steps to starting a successful open data program. Below is a description of each contributor:
When identifying and preparing your first datasets for release, we recommend prioritizing easy wins, such as data that is already made public (but not open) in some form or data that is frequently requested through the FOIL process. For us, we focused primarily on data that was already made public on a regular basis via PDFs. For example, monthly performance metrics for MTA board and committee materials were only shared in PDFs, but the open data program allowed us to automate and share that data in an open format to supplement the monthly PDF releases.
A small but critical benefit of focusing open data efforts on data that is already public in some form is that it allows for progress in the absence of a formalized data governance policy. Getting your first datasets in a machine-readable format up on a portal will reveal the gaps in your technical processes. Being able to figure out the technical details and process, without simultaneously having the complicated conversations of whether the data is okay to release in the first place, is an approach that worked well for us.
The importance of engaging the open data community by building relationships with key stakeholders and advocates cannot be understated! These organizations will be your biggest champions while also holding your organization accountable in a way that sustains momentum. Advocates are important external validators, and if they can provide positive feedback on your first open data releases, it will help build momentum internally for your open data program.
If open data advocates don't exist in your space, you could take the initiative to generate excitement and communicate your work. To start, you could participate in or host public-facing events about open data, partner with an existing local civic tech group, or work with the press office to generate positive coverage on your first open data release.
You should not expect all advocates to have the desire to work directly with open datasets you publish; doing some work to invest in data visualizations tools (built off open data, of course!) will go a long way. Accompanying visualizations increase the visibility of your program and secure advocates who operate more in the policy and communications sphere. We put quite a bit of work into building our data visualization tool metrics.mta.info early on, which visualizes open datasets in a way that is easily digestible for people of all technical skill levels.
Finally, let data users connect directly with the person and/or team managing your program. A team inbox ([email protected]) has worked well for us.
By the end of year one, you should have a few key datasets published to an open data portal, proved the value of open data to your organizations, and built relationships with a few key stakeholders both internal and external to your agency.
After our first year of the program, we were still manually uploading every single open dataset to the open data portal. We desperately needed the expertise of a data engineer to scale the program in a sustainable way. But we needed to take those first steps of sharing data openly to know what exactly we needed. All that's to say, don't let a lack of technical skills inhibit you from seeing what you can accomplish. We were still able to publish a lot of datasets in our first year of the program - and now we can publish even more.
We hope this post has given some concrete ideas on where to start when it comes to spinning up an open data program at your agency or organization. And for those reading because you hope to see a public agency in your city publish more open data, we encourage you to approach your advocacy efforts with understanding and optimism as the organization builds its capacity and resources to change the status quo.
Lisa Mae Fiedler manages MTA's Open Data program. She is thrilled to celebrate NYC Open Data Week later this month-you can check out the calendar of events here.