1

Explain the capabilities and workflow of ChatGPT Advanced Data Analysis (formerly Code Interpreter).

2

Differentiate between Structured and Unstructured data with suitable examples.

Data is generally categorized based on its organization and format.

Feature	Structured Data	Unstructured Data
Definition	Data that adheres to a pre-defined data model and is highly organized.	Data that does not have a pre-defined model or specific format.
Storage	Relational Databases (RDBMS), SQL tables.	Data Lakes, NoSQL databases, Data warehouses.
Format	Rows and columns, fixed fields.	Audio, video, images, text documents, emails.
Searchability	Easy to search using SQL queries.	Difficult to search; requires processing (OCR, NLP).
Volume	Typically accounts for ~20% of enterprise data.	Accounts for ~80% of enterprise data (Big Data).
Examples	Excel spreadsheets, Bank transaction logs, Inventory lists.	Social media posts, Surveillance video, Customer support audio logs.

3

Describe the role of Tableau in AI-driven data visualization and how it integrates with AI features.

4

What is a Data Pipeline? Explain its core components.

5

Discuss the advantages and disadvantages of Cloud Deployment versus Edge Deployment for AI models.

6

Define MLOps and explain why it is essential for the AI lifecycle.

7

Explain the concept of Data Drift and Concept Drift in the context of Model Troubleshooting.

8

Describe the AI Project Lifecycle in detail.

9

What are the common sources of error in AI models? How can they be identified?

10

Explain AI Process Automation and distinguish it from traditional RPA.

AI Process Automation, often called Intelligent Process Automation (IPA), combines traditional automation with AI technologies (Computer Vision, NLP, ML) to handle complex processes.

Distinction from RPA (Robotic Process Automation):

Feature	RPA (Traditional)	AI Process Automation (IPA)
Nature	Rule-based. Follows strict "if-then" instructions.	Data-driven. Learns from patterns and improves over time.
Data Type	Handles structured data (spreadsheets, forms).	Handles unstructured data (emails, chats, scanned docs).
Flexibility	Rigid. Breaks if the user interface changes.	Adaptive. Can handle exceptions and variations.
Capabilities	Copy-pasting, scraping web data, moving files.	Sentiment analysis, image recognition, decision making.
Example	Automating invoice entry from a standard Excel sheet.	Reading a scanned PDF invoice, extracting fields, and deciding approval based on context.

11

How do Cloud Services (AWS, Azure, Google Cloud) facilitate AI development? Provide examples of specific services.

12

Discuss the challenges involved in processing Unstructured Data.

13

Derive the need for Model Versioning and Data Versioning in the AI lifecycle.

14

Explain the concept of Bias-Variance Trade-off using mathematical intuition.

15

What is Troubleshooting in the context of AI? List the steps to troubleshoot a model with low accuracy.

16

Elaborate on the significance of Automated Data Pipelines in modern organizations.

17

How does Edge AI address privacy and bandwidth concerns? Give a real-world scenario.

18

Discuss Data Visualization principles that ensure AI insights are communicated effectively.

19

Explain the Deployment phase of the AI Lifecycle. What are the different deployment strategies?

20

What is Feature Engineering and why is it considered a crucial step in data analysis for AI?

Unit6 - Subjective Questions

Cloud Deployment

Edge Deployment

Send Feedback

Thank You!