Case Study / 06

2006 - 2009

Foundational Data Automation

Tata Consultancy Services / WNS

Analyst Programmer

The Narrative

01 / Vision

Eliminatehumanerrorandmaximiseefficiencyinhigh-stakes,multi-clientdataanalysisenvironments—turningwhatwasaslow,manual,error-proneprocessintoareliable,automatedpipelinethatcouldscaleacrossmillionsofrecordsanddozensofsimultaneousprojectstreams.

02 / Execution

Joined TCS/WNS as a junior analyst programmer and immediately took on a portfolio spanning multiple clients and multiple concurrent projects — market research datasets and medical questionnaire analysis with rigorous validation standards.

Developed a reusable suite of Shell Scripts that automated the full pipeline: raw data ingestion, schema-specific cleaning and transformation, multi-dimensional validation, and formatted output delivery. Scripts were parameterised by client schema so a single pipeline codebase served every account. Ran processing asynchronously overnight, freeing working hours for exception handling and quality verification rather than manual data entry.

Within months, productivity metrics ranked consistently at the top of the team — a pace sustained for five consecutive months. The automation patterns built here were adopted as the team standard. Before the first year was complete, transitioned from individual contributor to leading a team, applying the same systematic thinking that drove personal output to coordinate and grow a broader group.

03 / Result

Achieved the highest productivity hours in the organisation for five consecutive months while concurrently managing multiple client accounts and project streams with millions of records. Promoted to team lead within the first year.

Automation pipelines reduced error rates from 15% to 0.2% and cut report turnaround from days to minutes — establishing a new benchmark for data processing reliability that the wider team adopted as standard practice.

Strategic Decisions

Architectural Decision Record #01

Shell Scripting for High-Volume Data Pipelines

The Problem

Repetitive, manual analysis of millions of records across market research and medical questionnaire datasets was prone to human error, took days per cycle, and did not scale across multiple simultaneous client accounts.

The Solution

Developed a suite of reusable Shell Scripts to automate the cleaning, transformation, validation, and ingestion of raw data — parameterised per client schema so the same pipeline could service multiple accounts without duplication.

Measurable Impact

Turnaround time for major reports decreased from days to minutes, enabling simultaneous delivery across multiple client projects with 100% data consistency.

Architectural Decision Record #02

Multi-Client Parallel Project Management

The Problem

Managing deliverables across multiple clients and project streams simultaneously created scheduling conflicts and the risk that quality would degrade under volume pressure.

The Solution

Built a personal workflow system for batching and sequencing tasks by client SLA priority, with automated pipeline runs scheduled to run overnight so analyst time was reserved for validation and exception handling rather than raw processing.

Measurable Impact

Sustained the highest productivity hours in the organisation for five consecutive months across all active client accounts, with no missed deadlines and no client escalations.

Strategic Impact

+90%Automation Gain

5 monthsTop Productivity

MillionsRecords Processed

0.2%Error Rate

Technical Ecosystem

Shell ScriptingQuantumSPSSLinuxData AutomationETL

Next Case Study

CouponDunia

Scaling Mobile Engagement

Case Study / 06

2006 - 2009

Foundational Data Automation

Tata Consultancy Services / WNS

Analyst Programmer

The Narrative

01 / Vision

Eliminatehumanerrorandmaximiseefficiencyinhigh-stakes,multi-clientdataanalysisenvironments—turningwhatwasaslow,manual,error-proneprocessintoareliable,automatedpipelinethatcouldscaleacrossmillionsofrecordsanddozensofsimultaneousprojectstreams.

02 / Execution

03 / Result

Strategic Decisions

Architectural Decision Record #01

Shell Scripting for High-Volume Data Pipelines

The Problem

The Solution

Measurable Impact

Turnaround time for major reports decreased from days to minutes, enabling simultaneous delivery across multiple client projects with 100% data consistency.

Architectural Decision Record #02

Multi-Client Parallel Project Management

The Problem

Managing deliverables across multiple clients and project streams simultaneously created scheduling conflicts and the risk that quality would degrade under volume pressure.

The Solution

Measurable Impact

Sustained the highest productivity hours in the organisation for five consecutive months across all active client accounts, with no missed deadlines and no client escalations.

Strategic Impact

+90%Automation Gain

5 monthsTop Productivity

MillionsRecords Processed

0.2%Error Rate

Technical Ecosystem

Shell ScriptingQuantumSPSSLinuxData AutomationETL

Next Case Study

CouponDunia

Scaling Mobile Engagement