Skip to content

Commit

Permalink
add truncate to blogs (#6290)
Browse files Browse the repository at this point in the history
  • Loading branch information
mirnawong1 authored Oct 15, 2024
2 parents 5216428 + b578aae commit 3c88cc8
Show file tree
Hide file tree
Showing 2 changed files with 4 additions and 0 deletions.
Original file line number Diff line number Diff line change
Expand Up @@ -16,6 +16,8 @@ If you haven’t paid attention to the data industry news cycle, you might have

But I have to be honest: **I don’t care**. But not for the reasons you think.

<!-- truncate -->

## What is Iceberg?

To have this conversation, we need to start with the same foundational understanding of Iceberg. Apache Iceberg is a high-performance open table format developed for modern data lakes. It was designed for large-scale datasets, and within the project, there are many ways to interact with it. When people talk about Iceberg, it often means multiple components including but not limited to:
Expand Down
2 changes: 2 additions & 0 deletions website/blog/2024-10-05-snowflake-feature-store.md
Original file line number Diff line number Diff line change
Expand Up @@ -13,6 +13,8 @@ Flying home into Detroit this past week working on this blog post on a plane and

Think of the manufacturing materials needed as our data and the building of the bridge as the building of our ML models. There are thousands of engineers and construction workers taking materials from all over the world, pulling only the specific pieces needed for each part of the project. However, to make this project truly work at this scale, we need the warehousing and logistics to ensure that each load of concrete rebar and steel meets the standards for quality and safety needed and is available to the right people at the right time &mdash; as even a single fault can have catastrophic consequences or cause serious delays in project success. This warehouse and the associated logistics play the role of the feature store, ensuring that data is delivered consistently where and when it is needed to train and run ML models.

<!-- truncate -->

## What is a feature?

A feature is a transformed or enriched data that serves as an input into a machine learning model to make predictions. In machine learning, a data scientist derives features from various data sources to build a model that makes predictions based on historical data. To capture the value from this model, the enterprise must operationalize the data pipeline, ensuring that the features being used in production at inference time match those being used in training and development.
Expand Down

0 comments on commit 3c88cc8

Please sign in to comment.