DEA-C02試験無料問題集「Snowflake SnowPro Advanced: Data Engineer (DEA-C02) 認定」
You are tasked with implementing row-level filtering on a 'customers' table in Snowflake. You need to restrict access to customer data based on the user's region. The 'customers' table has a 'region' column, and you have a mapping table 'user regions' that associates users with the regions they are allowed to access. The 'user_regions' table has columns 'username' and 'region'. Which of the following SQL statements correctly creates and applies a row access policy to achieve this, minimizing complexity and maximizing performance? Select all that apply:


正解:A,C
解答を投票する
解説: (GoShiken メンバーにのみ表示されます)
A Snowflake table 'CUSTOMER ORDERS is clustered by 'ORDER DATE. You have observed the clustering depth increasing over time, impacting query performance. To improve performance, you decide to recluster the table. However, you need to minimize the impact on concurrent DML operations and cost. Which of the following strategies would be MOST effective in managing this reclustering process?
正解:D
解答を投票する
解説: (GoShiken メンバーにのみ表示されます)
A data engineer is investigating high credit consumption on a Snowflake warehouse due to frequent re-clustering operations on a large table named 'WEB EVENTS. This table is clustered on 'EVENT TIMESTAMP' and 'USER ID. The engineer suspects that the high frequency of data ingestion, especially out-of-order 'EVENT TIMESTAMP' values, contributes to the poor clustering. Choose the options that can lead to optimizing clustering and reducing credit consumption, assuming you have limited control over the ingestion process and data quality.
正解:A,C
解答を投票する
解説: (GoShiken メンバーにのみ表示されます)
You are designing a Snowpipe pipeline to ingest data from an AWS SQS queue. The queue contains notifications about new files arriving in an S3 bucket. However, due to network issues, some notifications are delayed, causing Snowpipe to potentially miss files. Which of the following strategies, when combined, will BEST address the problem of delayed notifications and ensure data completeness?
正解:C
解答を投票する
解説: (GoShiken メンバーにのみ表示されます)
You are tasked with optimizing a Snowpipe Streaming pipeline that ingests data from Kafka into a Snowflake table named 'ORDERS' You notice that while the Kafka topic has high throughput, the data ingestion into Snowflake is lagging. The pipe definition is as follows: "sql CREATE OR REPLACE PIPE ORDERS_PIPEAS COPY INTO ORDERS FROM @KAFKA STAGE FILE_FORMAT = (TYPE = JSON); Which of the following actions, taken individually, would be MOST effective in improving the ingestion rate, assuming sufficient compute resources are available in your Snowflake virtual warehouse?
正解:D
解答を投票する
解説: (GoShiken メンバーにのみ表示されます)
You have implemented a masking policy on the 'SSN' column of the 'EMPLOYEES' table. You now need to suspend the masking policy temporarily for a specific batch job that requires access to the unmasked data'. What is the recommended way to achieve this without dropping the masking policy or altering the user's role?
正解:D
解答を投票する
解説: (GoShiken メンバーにのみ表示されます)
You are managing a Snowflake environment where data retention is set to the default 1 day for all databases and tables. You need to clone a production table, 'CUSTOMER DATA, to a development environment to test some complex transformations. However, after cloning, you realize that the original 'CUSTOMER DATA' table in production was accidentally dropped 2 days ago. Which of the following statements accurately describe the situation and what can be done, if anything?
正解:D
解答を投票する
解説: (GoShiken メンバーにのみ表示されます)
You have an external table in Snowflake pointing to data in Azure Blob Storage. The data consists of customer transactions, and new files are added to the Blob Storage daily You want to ensure that Snowflake automatically picks up these new files and reflects them in the external table without manual intervention. However, you are observing delays in Snowflake detecting the new files. What are the potential reasons for this delay and how can you troubleshoot them? (Choose two)
正解:B,E
解答を投票する
解説: (GoShiken メンバーにのみ表示されます)
You are developing a data pipeline in Snowflake that uses SQL UDFs for data transformation. You need to define a UDF that calculates the Haversine distance between two geographical points (latitude and longitude). Performance is critical. Which of the following approaches would result in the most efficient UDF implementation, considering Snowflake's execution model?
正解:E
解答を投票する
解説: (GoShiken メンバーにのみ表示されます)
You need to implement both a row access policy and a dynamic data masking policy on the 'EMPLOYEE table in Snowflake. The requirements are as follows: 1. Employees should only be able to see their own record in the 'EMPLOYEE table. 2. The 'SALARY' column should be masked for all employees except those with the 'HR ADMIN' role. Unmasked values are required for compliance reasons, they need to be available for 'HR ADMIN' role. Given the following table structure: CREATE TABLE EMPLOYEE ( EMPLOYEE ID INT, EMPLOYEE NAME STRING, SALARY NUMBER, EMAIL STRING ) ; Which of the following sets of steps correctly implement the row access policy and dynamic data masking policy?


正解:E
解答を投票する
解説: (GoShiken メンバーにのみ表示されます)
Given the following scenario: You have an external table 'EXT SALES in Snowflake pointing to a data lake in Azure Blob Storage. The storage account network rules are configured to only allow specific IP addresses and virtual network subnets, enhancing security. You are getting intermittent errors when querying 'EXT SALES. Which of the following could be the cause(s) and the corresponding solution(s)? Select all that apply.
正解:B,E
解答を投票する
解説: (GoShiken メンバーにのみ表示されます)
You're tasked with building a data pipeline using Snowpark Python to incrementally load data into a target table 'SALES SUMMARY from a source table 'RAW SALES. The pipeline needs to ensure that only new or updated records from 'RAW SALES are merged into 'SALES SUMMARY' based on a 'TRANSACTION ID'. You want to use Snowpark's 'MERGE' operation for this, but you also need to handle potential conflicts and log any rejected records to an error table 'SALES SUMMARY ERRORS'. Which of the following approaches offers the MOST robust and efficient solution for handling errors and ensuring data integrity within the MERGE statement?
正解:B
解答を投票する
解説: (GoShiken メンバーにのみ表示されます)
You are planning to monetize a dataset on the Snowflake Marketplace. You want to provide potential customers with sample data to evaluate before they purchase a full subscription. Which of the following strategies are valid and recommended for offering a free sample of your data within the Snowflake Marketplace? (Select all that apply)
正解:B,D
解答を投票する
解説: (GoShiken メンバーにのみ表示されます)
You are responsible for monitoring data quality in a Snowflake data warehouse. Your team has identified a critical table, 'CUSTOMER DATA, where the 'EMAIL' column is frequently missing or contains invalid entries. You need to implement a solution that automatically detects and flags these anomalies. Which of the following approaches, or combination of approaches, would be MOST effective in proactively monitoring the data quality of the 'EMAIL' column?
正解:A,C,E
解答を投票する
解説: (GoShiken メンバーにのみ表示されます)
You are tasked with sharing a subset of a customer table (CUSTOMER DATA') residing in your organization's Snowflake account with a partner organization. You need to mask personally identifiable information (PII) while providing near real-time updates. You decide to use a secure view. Which of the following SQL statements is the MOST efficient and secure way to accomplish this, assuming the partner only needs 'customer id', 'masked_email', 'city', and 'state'? The email should be masked using SHA256.


正解:D
解答を投票する
解説: (GoShiken メンバーにのみ表示されます)