Shefali Singhal

Davis, CA

(530) 848 5602

shefali.singhal@gmail.com

http://wwwcsif.cs.ucdavis.edu/~singhals/

 

Objective

To obtain a challenging R&D/Scientist position.

Skills & Achievements

·        Data Analysis of streaming Traffic Data.

·        Modeling and Analysis of Gigabytes/Terabytes of streaming geospatial data.

·        Estimation and cost models for processing of geospatial data.

·        Estimation/Forecasting using Time Series Analysis

·        Complexity analysis: Identification of problem complexity, NP complete, NP hard, etc.

·        Developing optimizations algorithms: Linear Programming, Dynamic Programming, Search Trees, heuristics

·        Distributed computing: routing, task allocation, resource management, constraint analysis, transaction management, concurrent programming

 

 

Software Skill-Set

Languages:                                C, C++, Java, C#, Visual Basic .Net, SQL, PL/SQL, XML, DTD.

Operating Systems:                   Windows, Linux, Unix, MS-DOS

Databases:                                Oracle, SQL Server, Microsoft Access, Postgres, MySQL

Tools/IDE:                                Visual Studio .Net, JBuilder, UML, Visio, Rational Rose

Others:                                      Multi-threaded Programming, Sockets, TCP/IP, Web Servers

Web/GUI Tools:                       HTML, Javascript, MS Front Page.

 Scripting:                                 Bash scripts, Perl, Python

Very quick at picking up new languages/technologies/researching new areas.

 

Education

Ph.D. in Computer Science (Since Fall 2002)                             GPA: 3.83

University of California, Davis, CA

Major: Database Systems

Minor: Theory

Dissertation: Distributed processing of geospatial image data

(member of GeoStreams Project – http://geostreams.ucdavis.edu/ )

Doctoral Candidate: since Nov. 2005

Expected Graduation:

Sep. 2006

·          Developed data and query model for streaming geospatial image data received from satellites.

·          Real-time processing of Terabytes of data/day .

·          Developed a cluster-based stream processing framework and formalized the operator scheduling problem/identified its complexity class.

·          Developed operator scheduling algorithms for data stream processing systems using heuristics.

 

MS in Computer Science (Honors)                                             GPA: 3.95

Oklahoma City University, Oklahoma City, OK

Major: Database Systems

 

Dec’ 2000

BE in Production Engineering,                                                    GPA: 7.88/10

Sardar Patel University, Gujarat, India

Senior Project:

         Optimization of a production line using Linear Programming.

 

1996

Research

My current research interests are in the areas of:

·     Algorithms/Problem Complexity/Optimizations

·     Databases and Query Processing

·     Distributed Computing

 

·     Data modeling

·     Scientific data modeling and management

·     System Design

Academic Positions

Graduate Student Researcher

·     Development of distributed computing framework for streaming geospatial image data received from satellites.

·     Data/Query models, cost models, scheduling algorithms.

·     Optimizations : Dynamic Programming, Linear Programming, Heuristics, Time Series Analysis

·     Development of a Java-based Simulator to simulate distributed processing of streaming image data.

·     Parsing image data stream received from GOES satellite and generating a stream of PNG images.

 

2003 – present

Teaching Assistant :Database Systems, UC, Davis

Teaching Assistant :Advanced Database Systems, UC, Davis

Lab Assistant, Okalahoma City University

 

Winter  2002

Spring  2003

Fall  2000

Work Experience

Data Mining/Traffic Analyst (R&D), Dash Navigation, Mountain View, CA

·     Responsible for design and development of tools for data mining, data quality evaluation and spatial/temporal data analysis of traffic data and outlier detection algorithms.

·     Tools employed:  MySQL, SQL, Java, Perl, Bash scripts.

 

Since Nov. 2006

Design Engineer, Larsen & Toubro Ltd (India’s largest engineering & construction conglomerate)

·     Responsible for design, development, testing and deployment of software packages to automate the design of a power plant. Some of the equipments designed included conveyors, cranes, lifts, etc.

·     Tools employed :  Oracle, Visual Basic, C/C++, Windows, SQL, PL/SQL

·     Principle lead for preparing design document, technical bid analysis and design approval for several power plant equipments.

1996-1999

Memberships & Affiliations

 

·     Member of the Program Committee: 2003 Student Workshop on Computing (CS Department, UCD.)

·     Student Member of ACM

·     Student Member of IEEE

·     Member of a LSIP (Large Scale Interaction Process) Team at L&T-Sargent and Lundy.

o Helped develop programs to improve management and staff interaction/communication.

o Developed and organized a seminar series to promote learning among staff.

 

 

Publications

 

 

·        Michael Gertz, Quinn Hart, Carlos Rueda, Shefali Singhal, and Jie Zhang: A Data and Query Model for Streaming Geospatial Image Data. To appear in Query Languages and Query Processing, 11th International Workshop on Foundations of Models and Languages for Data and Objects (FMLDO), 2006

            Environmental monitoring satellites stream terabytes of image data to Earth each day. This paper introduces the data and query model for real time processing of streaming image data received from satellites.

·        Shefali Singhal, Michael Gertz: Operator Scheduling in a Distributed Stream Management System for Remotely-Sensed Imagery. Technical Report CSE-2005-22, October 2005.

Real-time processing petabytes/day of raster image data from multiple satellites requires a distributed computing environment. In a distributed computing environment, task (operator) allocation to nodes is essential for efficient resource management and performance improvement. This paper presents the problem complexity analysis and optimization algorithms for this problem.

·        Shefali Singhal, Michael Gertz: Spatially-Aware Operator Scheduling for RSI Data Streams. To be submitted for conference publication.

Exploiting the spatial extents of both incoming images and user queries gives rise to new operator scheduling techniques. In this paper, we introduce a spatially aware, heuristic operator scheduling algorithm, which aims at minimizing resource consumption (optimization problem) and reducing data processing latency.

·        Shefali Singhal, Michael Gertz: DiGSim: Distributed GeoStreams Simulator . To be submitted for conference publication.

DiGS: Distributed GeoStreams Simulator, is a Java-based simulator to test and develop scheduling algorithms for operator scheduling in a distributed processing environment for remotely-sensed image (RSI) data streams. Simulation allows us to model a distributed computing framework and analyze our algorithms and processing techniques.

 

 

Graduate Course Work

 

Design and Analysis of Algorithms

Modern Cryptography

Computer and Information Security

Logics and Knowledge Representation

Advanced Database Systems

Distributed Database Systems

Operation System Models

Advanced Computer Architecture

Programming Languages

Computer Networks

Internet Quality of Service

Wide-Area Distributed Information Systems