The post NEAR Protocol BigQuery Public Dataset appeared first on NEAR Protocol.
]]>Today, we are excited to announce the NEAR BigQuery Public Dataset for anyone who wants to query blockchain data in an easy and cost-effective way.
Until now, a user’s data query needs were fulfilled by indexers. Those indexers were either supplied by NEAR Protocol or custom made. To build custom indexers required JSON files from the NEAR Lake storage layer to be transformed and loaded into a target database engine like PostgreSQL, and only then could a user execute queries against it. This approach is complex, time-consuming, and resource-draining. It requires constant monitoring to ensure databases have the most up-to-date information.
NEAR BigQuery Public Dataset changes this. It provides near real-time blockchain data that can be easily queried with SQL.
We built the NEAR LakeHouse in Databricks. The data is loaded into raw bronze files using Databricks Autoloader, and transformed with Databricks Delta Live Tables into cleaned and enriched silver tables following the Databricks Medallion Architecture. The silver tables are then copied into the GCP BigQuery Public Dataset ready for consumption.
The code is open-source and can be found in our GitHub repository: near/near-public-lakehouse
To learn more about how to get started and the data available, please check our documentation:
https://docs.near.org/bos/queryapi/big-query
NEAR BigQuery Public Dataset is now available for anyone wanting to harness blockchain data for their own needs. BigQuery can help not only developers, but broader audiences including:
We are grateful for the following contributors who helped us to deliver the NEAR BigQuery Public Dataset.
The post NEAR Protocol BigQuery Public Dataset appeared first on NEAR Protocol.
]]>