D-DS-FN-23試験無料問題集（358題）「EMC Dell Data Science Foundations 認定」

出題：1

Which activity is performed in the Operationalize phase of the Data Analytics Lifecycle?

A. Try different analytical techniques

B. Define the process to maintain the model

C. Try different variables

D. Transform existing variables

正解：B 解答を投票する

出題：2

Consider the following SQL statement:
SELECT employee_id, year, salary, avg(salary)
OVER
(PARTITION BY employee_id ORDER BY year ROWS BETWEEN 2 PRECEDING AND CURRENT ROW) as result_1 FROM employee ORDER BY employee_id, year For each employee_id, what is returned as result_1?

A. Average employee_id

B. Average salary across all employee_id values

C. Three year rolling average salary

D. Four year rolling average salary

正解：C 解答を投票する

出題：3

What would be considered "Big Data"?

A. Daily Log files from a web server that receives 100, 000 hits per minute

B. Aggregated statistical data stored in a relational database table

C. Spreadsheets containing monthly sales data for a Global 100 corporation

D. An OLAP Cube containing customer demographic information about 100, 000, 000 customers

正解：A 解答を投票する

出題：4

You have created a Linear Regression model to predict total sales based on variables M, N, P and Q as shown in the graphic. You originally expected all variables to have positive coefficients.

Which action would you take?

A. Accept only statistically significant variables and investigate correlated independent variables

B. Accept none of the variables and investigate correlations between all variables

C. Accept all variables and begin model validation steps against holdout data

D. Accept only positive variables and investigate potential correlation with the dependent variable

正解：B 解答を投票する

出題：5

Data visualization is used in the final presentation of an analytics project.
For what else is this technique commonly used?

A. Descriptive statistics

B. Model selection

C. Assessing data quality

D. ETLT

正解：C 解答を投票する

出題：6

You are performing a market basket analysis using the Apriori algorithm.
Which measure is a ratio describing the how many more times two items are present together than would be expected if those two items are statistically independent?

A. Confidence

B. Leverage

C. Lift

D. Support

正解：C 解答を投票する

出題：7

An IT department deployed a spam filter to reduce the amount of junk e-mail received by its employees.
After six months, they notice that the spam filter is less effective than when initially deployed.
They examine the system running the spam filter and it appears to be operating normally.
What action would improve the effectiveness of the spam filter?

A. Add more storage to the spam filtering system

B. Add more processing power to the spam filtering system

C. Create a linear regression model to calculate the probability of an email being spam

D. Retrain the spam filter with newer examples of spam emails

正解：D 解答を投票する

出題：8

Refer to the exhibit.

In association rules, for itemsets X and Y, which expression defines leverage?

A. b

B. a

C. c

D. d

正解：B 解答を投票する

出題：9

How should project results be communicated to executives and the project sponsor?

A. Demonstrate your technical prowess to establish credibility

B. Provide model performance visualizations

C. Emphasize coding details and technical requirements

D. Focus on business outcomes and benefits

正解：D 解答を投票する

解説: (GoShiken メンバーにのみ表示されます)

出題：10

Which analytic technique would be appropriate to estimate blood pressure based on age and weight?

A. K-means clustering

B. Time series analysis

C. Linear regression

D. Naïve Bayesian classification

正解：C 解答を投票する

出題：11

Which data asset is an example of semi-structured data?

A. Webserver log

B. News article

C. XML data file

D. Database table

正解：C 解答を投票する

出題：12

A data scientist is given an R data frame (i.e., empdata) with the following columns: Age Salary Occupation Education Gender The scientist wants to examine only the Salary and Occupation columns for ages greater than '40'.
Which command extracts the appropriate rows and columns from the data frame?

A. empdata[empdata$Age > 40, c("Salary","Occupation")]

B. empdata[c("Salary","Occupation"), empdata$Age > 40]

C. empdata[Age > 40, ("Salary","Occupation")]

D. empdata[, c("Salary","Occupation")]$Age > 40

正解：A 解答を投票する

D-DS-FN-23試験無料問題集「EMC Dell Data Science Foundations 認定」