Cognitive Class: DataOps Methodology Exam Answers 2021

Cognitive Class: DataOps Methodology Exam Answers 2021

A policy is a set of principles intended to direct decisions and produce objective outcomes. A policy is a declaration of intent that is carried out by a process or protocol. A governance body within an entity usually adopts policies. Both emotional and rational decision-making may be aided by policies. Subjective decision-making policies typically aid senior management in making decisions that must be based on the relative merits of a variety of considerations, and as a result, they are also difficult to objectively assess. Policies to aid objective decision making, on the other hand, are normally operational in nature and can be objectively checked, such as a login policy.

1. How does data classification affect defining policies?

  1. Inheritance, retention and probabilities
  2. Protection, reporting and inheritance
  3. Protection, accessibility and retention
  4. Retention, deletion and storage

2. What impact does a highly sensitive classification have on a policy definition?

  1. Require data anonymization, de-identification, and masking
  2. Limit access to the data and/or require data masking
  3. Limit access to the data and make it unprintable
  4. No impact

3. What are the most common state, country or regional regulations affecting personal information?

  1. SIN, SSN and BAN
  2. FDIC, BCBS and SOX
  3. CCPA, GDPR and LGPD
  4. PCI, PII and PHI

4. Once policies have been defined affecting the data, rules must be enforced to act.

  1. True
  2. False

5. Self Service of data is only possible when any data movement and transformation required to join multiple data assets have been performed.

  1. True
  2. False

6. Self Service can use the following governance artefacts to refine a search in a catalog. (Choose all that apply)

  1. Data Protection Rules
  2. Business Terms
  3. Tags

7. A data consumer should not be able to access data that has been identified as sensitive, where there is not a business need to do so.

  1. True
  2. False

8. Which of the following statements about Self Service are ?

  1. Data consumers typically do not know how to manipulate the data
  2. Data Protection rules prevent a data consumer from inadvertently seeing data that is sensitive
  3. Creating multiple catalogs can partition data assets by their content and anticipated audience
  4. A data consumer needs to know SQL to join multiple data assets

9. Data Consumers provide valuable input to data scientists by clarifying the combination of data assets and how they need to be transformed, prior to data movement being designed and implemented.

  1. True
  2. False

10. You should define the use case at the outset of a Data Movement and Integration project to support a “Build It and They Will Come” strategy.

  1. True
  2. False.

11. Which of the following does not represent a data integration pattern:

  1. Data virtualization
  2. Data replication
  3. Data lineage
  4. Message-oriented movement
  5. Bulk/batch

12. Which of the following is not a Data Movement and Integration Job Design consideration?

  1. Design for reusability
  2. Deployment models (e.g. Containers, Kubernetes Orchestration, OpenShift)
  3. Design for parallel processing
  4. Everything should be programmed in Python
  5. Design for job portability (build once and run anywhere)

13. Hand coding generally provides a 10X productivity gain over commercial data integration software tooling.

  1. True
  2. False

14. Which of the following is not an example of a message queuing system?

  1. Kafka
  2. VSAM
  3. Microsoft Azure Queues
  4. GCP PubSub
  5. AWS Simple Queue Service
  6. MQ

15. DataOps is a completely new methodology and it doesn’t learn anything from agile and devOps.

  1. True
  2. False

16. Data consumers can first start to provide feedback to the current data sprint in the stakeholder review meeting.

  1. True
  2. False

17. Which of the following assets or artifacts could be found in catalog?

  1. Code
  2. Business terms
  3. Data rules
  4. Source data
  5. Data lineage

18. All issues need to be remediated before moving on to the next data sprint.

  1. True
  2. False

19. Completing a data sprint involves publishing governed artifacts and data assets to a production environment.

  1. True
  2. False

20. DataOps is a fixed process which should not be changed once defined.

  1. True
  2. False

21. Improvements to the DataOps process could involve changes to

  1. Technology used in DataOps
  2. DataOps team roles and responsibilities
  3. Processes for ETL
  4. All of the above  

22. Reviewing the Data classification phase involves reviewing how accurate the data mappings to the business terms are.

  1. True
  2. False

23. Reviewing the Establish Baseline Process should include reviewing how effective the processes are for establishing a baseline for –

  1. External Regulatory requirements
  2. Organization maturity and Readiness
  3. Governance and Oversight
  4. All of the above

24. KPIs are key in determining the effectiveness of all parts of the DataOps process.

  1. True
  2. False

25. What is a data strategy?

  1. An architecture and actionable roadmap along with an action plan
  2. A competitive publication to show that our organization is modern
  3. A plan to move all legacy data systems to the cloud

26. Which of the following statements about Data Strategy are ?

  1. Whatever the type of data, it should only include internally produced data
  2. All types of data – both structured and unstructured need to be considered
  3. Volumes of data have increased hugely, but are now starting to stabilize
  4. Only business executives should be consulted in putting together a strategy

27. Which of the following roles are active team members of any DataOps team?

  1. Chief Technology Officer
  2. Chief Data Officer
  3. Data Engineer
  4. Database Administrator
  5. Data Steward
  6. Data Architect
  7. Data Scientist

28. Creating and maintaining business terms is a major responsibility of which following role?

  1. Data Engineer
  2. Data Quality Analyst
  3. Data Steward
  4. Data Scientist

29. Business Priority should be the primary focus when deciding what the DataOps team should do.

  1. True
  2. False

30. What is a data backlog?

  1. A bottleneck in the data pipeline
  2. A list of all data sources
  3. A prioritized set of requirements expressed as data tasks
  4. A plan to move all data into a catalog

31. A Data Task should be prioritized by considering:

  1. The cost of providing the data
  2. The career advancement possibilities of solving business challenges
  3. The impact to sales from implementing the data pipeline
  4. All of the above

32. KPIs are used to determine the progress and throughput of a DataOps data sprint.

  1. True
  2. False

33. What are key components of DataOps toolchain?

  1. Continuous Deployment
  2. Communication
  3. Source Control
  4. All of above

34. Who is responsible for creating DataOps toolchain? (Choose all that apply)

  1. Data Scientist
  2. Administrator
  3. DBA
  4. Data Engineer

35. What is the primary objective of the Discover phase?

  1. Decide what the analytics team wants to have for lunch.
  2. Identify and locate the specific data elements required to accomplish an analysis
  3. Uncover the meaning of data column headers and how they relate to the underlying data.
  4. Gain an understanding of the business goals and KPIs of an analysis effort.

36. Which description best defines taxonomy?

  1. Organizing data elements into meaningful structures.
  2. An IBM network protocol which reduces network latency.
  3. The art of preparing, stuffing, and mounting the skins of animals with lifelike effect.

37. Which of the following is the objective of classification?

  1. To bring out points of similarity and dissimilarity among various groups.
  2. To present data in a simple, logical and understandable form.
  3. To condense the mass of data.
  4. All of the above

38. A data quality framework consists of which of the following 4 phases:

  1. Profile
  2. Define
  3. Remediate
  4. Monitor
  5. Assess
  6. Deploy

39. How does data classification affect defining policies?

  1. Inheritance, retention and probabilities
  2. Protection, reporting and inheritance
  3. Protection, accessibility and retention
  4. Retention, deletion and storage

40. What impact does a highly sensitive classification have on a policy definition?

  1. Require data anonymization, de-identification, and masking
  2. Limit access to the data and/or require data masking
  3. Limit access to the data and make it unprintable
  4. No impact

41. Self Service can use the following governance artefacts to refine a search in a catalog. (Choose all that apply)

  1. Data Protection Rules
  2. Business Terms
  3. Tags

42. Which of the following statements about Self Service are ?

  • A data consumer needs to know SQL to join multiple data assets
  • Data Protection rules prevent a data consumer from inadvertently seeing data that is sensitive
  • Creating multiple catalogs can partition data assets by their content and anticipated audience
  • Data consumers typically do not know how to manipulate the data

43. Which of the following does not represent a data integration pattern:

  1. Data virtualization
  2. Data replication
  3. Data lineage
  4. Message-oriented movement
  5. Bulk/batch

44. Which of the following is not a Data Movement and Integration Job Design consideration?

  1. Design for reusability
  2. Deployment models (e.g. containers, Kubernetes orchestration, OpenShift)
  3. Design for parallel processing
  4. Everything should be programmed in Python
  5. Design for job portability (build once and run anywhere)

45. Data consumers can first start to provide feedback to the current data sprint in the stakeholder review meeting.

  1. True
  2. False

46. Which of the following could be found in catalog?

  1. Code
  2. Business terms
  3. Data rules
  4. Source data
  5. Data lineage

47. All issues need to be remediated before moving on to the next data sprint.

  1. True
  2. False

48. Improvements to the DataOps process could involve changes to

  1. Technology used in DataOps
  2. DataOps team roles and responsibilities
  3. Processes for ETL
  4. All of the above

49. Reviewing the Establish Baseline Process should include reviewing how effective are the processes for establishing a baseline for –

  1. External Regulatory requirements
  2. Organization maturity and Readiness
  3. Governance and Oversight
  4. All of the above

50. Before we can put together a data strategy, we need to have a good understanding of the data available and how it is used in the organization.

  1. True
  2. False

51. What is a data strategy?

  1. An architecture and actionable roadmap along with an action plan
  2. A competitive publication to show that our organization is modern
  3. A plan to move all legacy data systems to the cloud

52. Implementing a data strategy should always result in cost savings in the year the plan is realized.

  1. True
  2. False  

53. Which of the following statements about Data Strategy are ?

  1. Whatever the type of data, it should only include internally produced data
  2. All types of data – both structured and unstructured need to be considered
  3. Volumes of data have increased hugely, but are now starting to stabilize
  4. Only business executives should be consulted in putting together a strategy

54. Data Governance is a key part of executing a data strategy.

  1. True
  2. False

55. A DataOps team consists of members mostly from IT departments.

  1. True
  2. False

56. Which of the following roles are active team members of any DataOps team?

  1. Chief Technology Officer
  2. Chief Data Officer
  3. Data Engineer
  4. Database Administrator
  5. Data Steward
  6. Data Architect
  7. Data Scientist

57. Creating and maintain business terms is a major responsibility of which following role?

  1. Data Engineer
  2. Data Quality Analyst
  3. Data Steward
  4. Data Scientist

58. Only Chief Data Officer can update the KPIs for a data sprint.

  1. True
  2. False

59. DataOps relies heavily on the use of automation, so that communication among team members is not necessary.

  1. True
  2. False

60. DataOps toolchain helps you deliver quality data slowly.

  1. True
  2. False  

61. DataOps Toolchain and DevOps are the same thing.

  1. True
  2. False

62. DataOps Toolchain can work without DataOps API(s).

  1. True
  2. False

63. What are the key components of DataOps Toolchain?

  1. Continuous Deployment
  2. Communication
  3. Source Control
  4. All of above

64. Who is responsible for creating DataOps Toolchain? (Choose all that apply)

  1. Data Scientist
  2. Administrator
  3. DBA
  4. Data Engineer

65. Data Management is the same as Information Governance.

  1. True
  2. False

66. What is the most costly result from an external influence to an organization?

  1. Data Breach Fines and Penalties
  2. Insurance Policy Payout
  3. Claim Settlement
  4. None of these

67. Reference data is defined as data used as a permissible value within a data field.

  1. True
  2. False

68. Business Priority should be the primary focus when deciding what the DataOps team should do.

  1. True
  2. False

69. What is a data backlog?

  1. A bottleneck in the data pipeline
  2. A list of all data sources
  3. A prioritized set of requirements expressed as data tasks
  4. A plan to move all data into a catalog

70. A prioritized data backlog will reduce the time taken to start the next DataOps iteration.

  1. True
  2. False

71. A Data Task should be prioritized by considering:

  1. The cost of providing the data
  2. The career advancement possibilities of solving business challenges
  3. The impact to sales from implementing the data pipeline
  4. All of the above

72. KPIs are used to determine the progress and throughput of a DataOps data sprint.

  1. True
  2. False

73. You will need someone on your team with detailed knowledge of the business processes you’re going to analyze so selected data elements are appropriate to reaching your objectives.

  1. True
  2. False

74. What should you do if you identify gaps or mismatches in the data required for the analysis?

  1. Rethink how you will do the analysis with different data
  2. Create the missing data
  3. Find a new source for the missing or mismatched data
  4. All of the above

75. You should trace the linage of data elements to be used for analysis to make sure they come from a trusted source.

  1. True
  2. False

76. What is the primary objective of the Discover phase?

  1. Decide what the analytics team wants to have for lunch
  2. Identify and locate the specific data elements required to accomplish an analysis
  3. Uncover the meaning of data column headers and how they relate to the underlying data
  4. Gain an understanding of the business goals and KPIs of an analysis effort

77. A Data Engineer who thoroughly understands where specific data resides, including the specific databases and files where each identified data element resides, should be involved in Data Discovery process.

  1. True
  2. False

78. Classification of each data element will make it easier going forward for users to distinguish the meaning and applicability of the data for their purposes.

  1. True
  2. False

79. Which description best defines taxonomy?

  1. Organizing data elements into meaningful structures
  2. An IBM network protocol which reduces network latency
  3. The art of preparing, stuffing, and mounting the skins of animals with lifelike effect

80. A single data element can be placed into an unlimited number of data domains.

  1. True
  2. False

81. Which of the following is the objective of classification?

  1. To bring out points of similarity and dissimilarity among various groups
  2. To present data in a simple, logical and understandable form
  3. To condense the mass of data
  4. All of the above

82. You should design workflows which are specific to the classification tool you are using.

  1. True
  2. False

83. Data quality is data accuracy.

  1. True
  2. False

84. All data across the enterprise should have the same data quality.

  1. True
  2. False

85. A data quality framework consists of which of the following 4 phases:

  1. Profile
  2. Define
  3. Remediate
  4. Monitor
  5. Assess
  6. Deploy

86. When assessing data quality, you only need the data set containing the data, metadata is optional.

  1. True
  2. False

A for-profit corporation or organisation is known as an enterprise, although it is most commonly associated with entrepreneurial enterprises. People who are successful entrepreneurs are sometimes referred to as “enterprising.” Acting Captain Spock (Zachary Quinto) – After Pike was kidnapped by Nero, First Officer Spock assumed command of the Enterprise as Acting Captain, which was opposed by cadet James T. Kirk, who later assumed command of the Enterprise as Acting Captain, leading to Nero’s defeat. Captain James T. (James T.) Simply put, enterprise is a person’s or an organization’s ability to: Take risks. It takes a lot of courage to start a new company.

Leave a Reply

Your email address will not be published. Required fields are marked *