|
Shefali Singhal |
|||||||
|
|
|||||||
|
(530) 848 5602 |
|||||||
|
http://wwwcsif.cs.ucdavis.edu/~singhals/
|
|||||||
Objective |
To obtain a challenging R&D/Scientist position. |
||||||
·
Data
Analysis of streaming Traffic Data. · Modeling and Analysis of Gigabytes/Terabytes of streaming geospatial data. · Estimation and cost models for processing of geospatial data. · Estimation/Forecasting using Time Series Analysis · Complexity analysis: Identification of problem complexity, NP complete, NP hard, etc. · Developing optimizations algorithms: Linear Programming, Dynamic Programming, Search Trees, heuristics ·
Distributed
computing: routing, task allocation, resource management, constraint
analysis, transaction management, concurrent programming
|
|||||||
Languages: C, C++, Java, C#, Visual Basic .Net, SQL, PL/SQL, XML, DTD. Operating Systems: Windows, Linux, Unix, MS-DOS Databases: Oracle, SQL Server, Microsoft Access, Postgres, MySQL Tools/IDE: Visual Studio .Net, JBuilder, UML, Visio, Rational Rose Others: Multi-threaded Programming, Sockets, TCP/IP, Web Servers Web/GUI Tools: HTML, Javascript, MS Front Page. Scripting: Bash scripts, Perl, Python Very quick at picking up new languages/technologies/researching new areas.
|
|||||||
Education |
|||||||
|
Ph.D. in Computer Science (Since Fall 2002) GPA: 3.83 Major: Database Systems Minor: Theory Dissertation: Distributed processing of geospatial image data (member of GeoStreams Project – http://geostreams.ucdavis.edu/ ) |
Doctoral Candidate: since Nov. 2005 Expected Graduation: Sep. 2006 |
||||||
|
· Developed data and query model for streaming geospatial image data received from satellites. · Real-time processing of Terabytes of data/day . · Developed a cluster-based stream processing framework and formalized the operator scheduling problem/identified its complexity class. · Developed operator scheduling algorithms for data stream processing systems using heuristics.
|
|||||||
|
MS in Computer Science (Honors) GPA: 3.95 Major: Database Systems
|
Dec’ 2000 |
||||||
|
BE in Production Engineering, GPA: 7.88/10 Senior Project: Optimization of a production line using Linear Programming.
|
1996 |
||||||
ResearchMy current research interests are in the areas of:
|
|||||||
Academic Positions |
|||||||
|
Graduate Student Researcher · Development of distributed computing framework for streaming geospatial image data received from satellites. · Data/Query models, cost models, scheduling algorithms. · Optimizations : Dynamic Programming, Linear Programming, Heuristics, Time Series Analysis · Development of a Java-based Simulator to simulate distributed processing of streaming image data. · Parsing image data stream received from GOES satellite and generating a stream of PNG images.
|
2003 – present |
||||||
|
Teaching Assistant
:Database Systems, UC,
Teaching Assistant
:Advanced Database Systems, UC, Lab
Assistant,
|
Winter 2002 Spring 2003 Fall 2000 |
||||||
Work Experience |
|||||||
|
Data
Mining/Traffic Analyst (R&D), Dash Navigation,
· Responsible for design and development of tools for data mining, data quality evaluation and spatial/temporal data analysis of traffic data and outlier detection algorithms. · Tools employed: MySQL, SQL, Java, Perl, Bash scripts. |
Since Nov. 2006 |
||||||
|
Design Engineer, Larsen & Toubro Ltd ( · Responsible for design, development, testing and deployment of software packages to automate the design of a power plant. Some of the equipments designed included conveyors, cranes, lifts, etc. · Tools employed : Oracle, Visual Basic, C/C++, Windows, SQL, PL/SQL · Principle lead for preparing design document, technical bid analysis and design approval for several power plant equipments. |
1996-1999 |
||||||
Memberships & Affiliations |
|
||||||
|
· Member of the Program Committee: 2003 Student Workshop on Computing (CS Department, UCD.) · Student Member of ACM · Student Member of IEEE · Member of a LSIP (Large Scale Interaction Process) Team at L&T-Sargent and Lundy. o Helped develop programs to improve management and staff interaction/communication. o Developed and organized a seminar series to promote learning among staff.
|
|
||||||
Publications
|
|
||||||
|
· Michael Gertz, Quinn Hart, Carlos Rueda, Shefali Singhal, and Jie Zhang: A Data and Query Model for Streaming Geospatial Image Data. To appear in Query Languages and Query Processing, 11th International Workshop on Foundations of Models and Languages for Data and Objects (FMLDO), 2006 Environmental monitoring satellites stream terabytes of image data to Earth each day. This paper introduces the data and query model for real time processing of streaming image data received from satellites. · Shefali Singhal, Michael Gertz: Operator Scheduling in a Distributed Stream Management System for Remotely-Sensed Imagery. Technical Report CSE-2005-22, October 2005. Real-time processing petabytes/day of raster image data from multiple satellites requires a distributed computing environment. In a distributed computing environment, task (operator) allocation to nodes is essential for efficient resource management and performance improvement. This paper presents the problem complexity analysis and optimization algorithms for this problem. · Shefali Singhal, Michael Gertz: Spatially-Aware Operator Scheduling for RSI Data Streams. To be submitted for conference publication. Exploiting the spatial extents of both incoming images and user queries gives rise to new operator scheduling techniques. In this paper, we introduce a spatially aware, heuristic operator scheduling algorithm, which aims at minimizing resource consumption (optimization problem) and reducing data processing latency. · Shefali Singhal, Michael Gertz: DiGSim: Distributed GeoStreams Simulator . To be submitted for conference publication. DiGS: Distributed GeoStreams Simulator, is a Java-based simulator to test and develop scheduling algorithms for operator scheduling in a distributed processing environment for remotely-sensed image (RSI) data streams. Simulation allows us to model a distributed computing framework and analyze our algorithms and processing techniques.
|
|
||||||
Graduate Course Work |
|
||||||
|
Design and Analysis of Algorithms Modern Cryptography Computer and Information Security Logics and Knowledge Representation Advanced Database Systems Distributed Database Systems |
Operation System Models Advanced Computer Architecture Programming Languages Computer Networks Internet Quality of Service Wide-Area Distributed Information Systems |
|
|||||
|
|
|
|
|
|
|
||