[Nov 22, 2023] Prepare For The DP-100 Question Papers In Advance [Q232-Q253]

4/5 - (1 vote)

[Nov 22, 2023] Prepare For The DP-100 Question Papers In Advance

DP-100 PDF Dumps Real 2023 Recently Updated Questions

Microsoft DP-100 exam is a great way for data scientists to validate their skills and knowledge in Azure data science solutions. Passing DP-100 exam shows that the candidate has the necessary skills to design, implement, and deploy data science solutions on Azure. Moreover, this certification can be a valuable asset for individuals who want to advance their career in the data science field, as it demonstrates their proficiency in various areas related to data science.

Microsoft DP-100 Exam Syllabus Topics:

Topic	Details
Topic 1	Determine Relative Size Of Splits Resample A Dataset To Impose Balance Adjust Performance Metric To Resolve Imbalances
Topic 2	Determine Ideal Split Based On The Nature Of The Data Determine Number Of Splits Identify Data Imbalances
Topic 3	Select An Algorithmic Approach Consider Data Preparation Steps That Are Specific To The Selected Algorithms
Topic 4	Determine Appropriate Performance Metrics Implement Appropriate Algorithms
Topic 5	Analyze And Recommend Tools That Meet System Requirements Set Up Development Environment
Topic 6	Assess The Deployment Environment Constraints Select The Development Environment
Topic 7	Review Visual Analytics Data To Discover Patterns And Determine Next Steps Design A Data Sampling Strategy

Q232. You plan to explore demographic data for home ownership in various cities. The data is in a CSV file with the following format:
age,city,income,home_owner
21,Chicago,50000,0
35,Seattle,120000,1
23,Seattle,65000,0
45,Seattle,130000,1
18,Chicago,48000,0
You need to run an experiment in your Azure Machine Learning workspace to explore the data and log the results. The experiment must log the following information:
the number of observations in the dataset
a box plot of income by home_owner
a dictionary containing the city names and the average income for each city You need to use the appropriate logging methods of the experiment’s run object to log the required information.
How should you complete the code? To answer, drag the appropriate code segments to the correct locations.
Each code segment may be used once, more than once, or not at all. You may need to drag the split bar between panes or scroll to view content.
NOTE: Each correct selection is worth one point.

Q233. You need to modify the inputs for the global penalty event model to address the bias and variance issue.
Which three actions should you perform in sequence? To answer, move the appropriate actions from the list of actions to the answer area and arrange them in the correct order.

Q234. You register a model that you plan to use in a batch inference pipeline.
The batch inference pipeline must use a ParallelRunStep step to process files in a file dataset. The script has the ParallelRunStep step runs must process six input files each time the inferencing function is called.
You need to configure the pipeline.
Which configuration setting should you specify in the ParallelRunConfig object for the PrallelRunStep step?

process_count_per_node= “6”

node_count= “6”

mini_batch_size= “6”

error_threshold= “6”

Q235. You create an experiment in Azure Machine Learning Studio. You add a training dataset that contains 10,000 rows. The first 9,000 rows represent class 0 (90 percent).
The remaining 1,000 rows represent class 1 (10 percent).
The training set is imbalances between two classes. You must increase the number of training examples for class 1 to 4,000 by using 5 data rows. You add the Synthetic Minority Oversampling Technique (SMOTE) module to the experiment.
You need to configure the module.
Which values should you use? To answer, select the appropriate options in the dialog box in the answer area.
NOTE: Each correct selection is worth one point.

Q236. An organization uses Azure Machine Learning service and wants to expand their use of machine learning.
You have the following compute environments. The organization does not want to create another compute environment.

You need to determine which compute environment to use for the following scenarios.
Which compute types should you use? To answer, drag the appropriate compute environments to the correct scenarios. Each compute environment may be used once, more than once, or not at all. You may need to drag the split bar between panes or scroll to view content.
NOTE: Each correct selection is worth one point.

Q237. You are building an experiment using the Azure Machine Learning designer.
You split a dataset into training and testing sets. You select the Two-Class Boosted Decision Tree as the algorithm.
You need to determine the Area Under the Curve (AUC) of the model.
Which three modules should you use in sequence? To answer, move the appropriate modules from the list of modules to the answer area and arrange them in the correct order.

Q238. You need to define a process for penalty event detection.
Which three actions should you perform in sequence? To answer, move the appropriate actions from the list of actions to the answer area and arrange them in the correct order.

Q239. You create an experiment in Azure Machine Learning Studio- You add a training dataset that contains 10.000 rows. The first 9.000 rows represent class 0 (90 percent). The first 1.000 rows represent class 1 (10 percent).
The training set is unbalanced between two Classes. You must increase the number of training examples for class 1 to 4,000 by using data rows. You add the Synthetic Minority Oversampling Technique (SMOTE) module to the experiment.
You need to configure the module.
Which values should you use? To answer, select the appropriate options in the dialog box in the answer area.
NOTE: Each correct selection is worth one point.

Q240. You train a classification model by using a decision tree algorithm.
You create an estimator by running the following Python code. The variable feature_names is a list of all feature names, and class_names is a list of all class names.
from interpret.ext.blackbox import TabularExplainer

You need to explain the predictions made by the model for all classes by determining the importance of all features.
For each of the following statements, select Yes if the statement is true. Otherwise, select No.
NOTE: Each correct selection is worth one point.

Q241. You are creating a new Azure Machine Learning pipeline using the designer.
The pipeline must train a model using data in a comma-separated values (CSV) file that is published on a website. You have not created a dataset for this file.
You need to ingest the data from the CSV file into the designer pipeline using the minimal administrative effort.
Which module should you add to the pipeline in Designer?

Convert to CSV

Enter Data Manually
D

Import Data

Dataset

Q242. You train a machine learning model.
You must deploy the model as a real-time inference service for testing. The service requires low CPU utilization and less than 48 MB of RAM. The compute target for the deployed service must initialize automatically while minimizing cost and administrative overhead.
Which compute target should you use?

Azure Kubernetes Service (AKS) inference cluster

Azure Machine Learning compute cluster

Azure Container Instance (ACI)

attached Azure Databricks cluster

Q243. You have a dataset that contains over 150 features. You use the dataset to train a Support Vector Machine (SVM) binary classifier.
You need to use the Permutation Feature Importance module in Azure Machine Learning Studio to compute a set of feature importance scores for the dataset.
In which order should you perform the actions? To answer, move all actions from the list of actions to the answer area and arrange them in the correct order.

Q244. You plan to use the Hyperdrive feature of Azure Machine Learning to determine the optimal hyperparameter values when training a model.
You must use Hyperdrive to try combinations of the following hyperparameter values:
* learning_rate: any value between 0.001 and 0.1
* batch_size: 16, 32, or 64
You need to configure the search space for the Hyperdrive experiment.
Which two parameter expressions should you use? Each correct answer presents part of the solution.
NOTE: Each correct selection is worth one point.

a choice expression for learning_rate

a uniform expression for learning_rate

a normal expression for batch_size

a choice expression for batch_size

a uniform expression for batch_size

Q245. You create an Azure Machine Learning workspace and install the MLflow library.
You need to tog different types of data by using the MLflow library.
Which method should you use? To answer, select the appropriate options in the answer area.
NOTE: Each correct selection is worth one point.

Q246. You create a multi-class image classification deep learning model that uses the PyTorch deep learning framework.
You must configure Azure Machine Learning Hyperdrive to optimize the hyperparameters for the classification model.
You need to define a primary metric to determine the hyperparameter values that result in the model with the best accuracy score.
Which three actions must you perform? Each correct answer presents part of the solution.
NOTE: Each correct selection is worth one point.

Set the primary_metric_goal of the estimator used to run the bird_classifier_train.py script to maximize.

Add code to the bird_classifier_train.py script to calculate the validation loss of the model and log it as a float value with the key loss

Set the primary_metric_goal of the estimator used to run the bird_classifier_train.py script to minimize.

Set the primary_metric_name of the estimator used to run the bird_classifier_train.py script to accuracy.

Set the primary_metric_name of the estimator used to run the bird_classifier_train.py script to loss.

Add code to the bird_classifier_train.py script to calculate the validation accuracy of the model and log it as a float value with the key

Q247. You have a binary classifier that predicts positive cases of diabetes within two separate age groups.
The classifier exhibits a high degree of disparity between the age groups.
You need to modify the output of the classifier to maximize its degree of fairness across the age groups and meet the following requirements:
* Eliminate the need to retrain the model on which the classifier is based.
* Minimize the disparity between true positive rates and false positive rates across age groups.
Which algorithm and panty constraint should you use? To answer, select the appropriate options in the answer are a. NOTE: Each correct selection is worth one point.

Q248. You are developing a data science workspace that uses an Azure Machine Learning service.
You need to select a compote target to deploy the workspace.
What should you use?

Azure Data Lake Analytics

Azure Databrick .

Apache Spark for HDInsight.

Azure Container Service

Q249. You are evaluating a Python NumPy array that contains six data points defined as follows:
data = [10, 20, 30, 40, 50, 60]
You must generate the following output by using the k-fold algorithm implantation in the Python Scikit-learn machine learning library:
train: [10 40 50 60], test: [20 30]
train: [20 30 40 60], test: [10 50]
train: [10 20 30 50], test: [40 60]
You need to implement a cross-validation to generate the output.
How should you complete the code segment? To answer, select the appropriate code segment in the dialog box in the answer area.
NOTE: Each correct selection is worth one point.

Q250. You plan to use the Hyperdrive feature of Azure Machine Learning to determine the optimal hyperparameter values when training a model.
You must use Hyperdrive to try combinations of the following hyperparameter values. You must not apply an early termination policy.
learning_rate: any value between 0.001 and 0.1
* batch_size: 16, 32, or 64
You need to configure the sampling method for the Hyperdrive experiment Which two sampling methods can you use? Each correct answer is a complete solution.
NOTE: Each correct selection is worth one point.

Grid sampling

No sampling

Bayesian sampling

Random sampling

Q251. You need to correct the model fit issue.
Which three actions should you perform in sequence? To answer, move the appropriate actions from the list of actions to the answer area and arrange them in the correct order.

Q252. You use Azure Machine Learning Studio to build a machine learning experiment.
You need to divide data into two distinct datasets.
Which module should you use?

Split Data

Load Trained Model

Assign Data to Clusters

Group Data into Bins

Q253. You create a binary classification model using Azure Machine Learning Studio.
You must use a Receiver Operating Characteristic (RO C) curve and an F1 score to evaluate the model.
You need to create the required business metrics.
How should you complete the experiment? To answer, select the appropriate options in the dialog box in the answer area.
NOTE: Each correct selection is worth one point.

The Microsoft DP-100 exam covers a range of topics related to data science, including data ingestion, transformation, and storage. Candidates will be tested on their ability to design and implement solutions using Azure tools and services, such as Azure Machine Learning, Azure Cognitive Services, and Azure Data Factory. They will also be tested on their ability to work with big data technologies such as Hadoop and Spark.

DP-100 Dumps and Practice Test (403 Exam Questions): https://www.dumpleader.com/DP-100_exam.html

Microsoft DP-100 Exam Syllabus Topics:

Leave a Reply Cancel reply