INTRODUCTION
Hydrologic modeling is very important in many water resources planning design and management activities (Jain and Srinivasulu, 2004). Forecasting of stream flow has been one of the important problems for hydrologists, reservoirs operators and flood protection engineers. In this connection, the relationship between rainfall and runoff has been widely studied in many conceptual rainfall runoff models (Danh et al., 1999). The RainfallRunoff process is believed to be highly nonlinear time varying, spatially distributed and not easily described by simple models. RainfallRunoff process consists of the movement of rainfall through different media and its transformation to the runoff in channels either natural or manmade (Phien and Danh, 1997). Two major approaches for modeling the RainfallRunoff process have been explored in literature: conceptual (physical) modeling and system theoretic modeling (black box). Conceptual RainfallRunoff (CRR) models are designed to approximate within their structures the general internal sub processes and physical mechanisms which govern the hydrologic cycle (Hsu et al., 1995). While such models ignore the spatially distributed, timevarying and stochastic properties of the RainfallRunoff process, they attempt to incorporate realistic representation of the major nonlinearities inherent in the RainfallRunoff relationships. Also conceptual models are of importance in the understanding of hydrologic processes; there are many practical situations such as stream flow forecasting where the main concern is with making accurate predictions at specific watershed locations. In such a situation, a hydrologist may prefer not to expend the time and effort required to develop and implement a conceptual model and instead implements a simpler system theoretic model. In the system theoretic approach, difference equation or differential equation models are used to identify a direct mapping between the input and output without detailed consideration of the internal structure of physical processes (Shamseldin, 1997).
Since the early nineties, Artificial Neural Networks (ANNs) have been successfully used in hydrology related areas such as RainfallRunoff modeling, stream flow forecasting, groundwater modeling, water quality, water management policy, precipitation forecasting, hydrologic time series and reservoir operations (Govindaraju, 2000). ANNs have been employed as alternative tools in developing nonlinear system theoretic models of the hydrological processes. An ANN is a nonlinear mathematical structure which is capable of representation arbitrarily complex nonlinear processes that relate the input and outputs of any system. Hydrologists have been exploring ANNs for more then 10 years (Ozgur, 2004). In general the advantages of ANNs over other statistical and conceptual models are:
• 
The application of ANNs does not require a prior knowledge
of the process because ANNs have blackbox properties. 
• 
ANNs have the inherent property of nonlinearity since neurons
activate a nonlinear filter called an activation function. 
• 
ANNs can have multiple input having different characteristics,
which can make ANNs able to represent the timespace variability. 
• 
ANNs have the adaptability to represent change of problem
environments (Kim and Valdes, 2003). 
However training process of ANNs requires significant amounts of data so that the patterns embedded in the system are discovered. In statistical time series forecasting, decomposition approaches seek to decompose a time series into its major sub components. Lately, Back Propagation Artificial Neural Networks (BPANNs), a particular type of neural network, have been developed and successfully used in many fields (Gorr et al., 1994; Lachtermacher and Fuller, 1995; Maier and Dandy, 1996).
Recently wavelet transform have become a common tool for analyzing local variation in time series (Kim and Valdes, 2003). These now classic approaches to ANN RR modeling combine information at various frequency scales, where the hydrological process consist of a superposition of many sources and results in the limitation that the underlying system switches between these different hydrologic sources, producing different dynamics (Anctil and Tape, 2004).
The objective of this study is to explore a conjunction model performance of a neuralwavelet hybrid system in order to RainfallRunoff forecasting. In this research, we present two types of models for daily RainfallRunoff process. The first type of models employ ANNs technique using total rainfall and discharge data. The second type of model use a conjugate method with using wavelet and ANNs that called neural wavelet networks (NWNs) or wavelet.
ARTIFICIAL NEURAL NETWORKS
The application of ANNs has been the topic of large number of paper. Use of neural network techniques to solve hydrologic engineering problem began in the late 1992. As shown in Fig. 1, threelayered feed forward neural networks which have been usually used in forecasting hydrologic time series, provide a general framework for representing nonlinear functional mapping between a set of input and out variables (Kim and Valdes, 2003).
The output is
determined by the architecture of the networks. The number of hidden layer which
serve as links between the input and output layers determined with trial and
error rule. Each input or signal x_{i} = (i = 1,....,n) is attenuated
or amplified by a factor w_{ji} the explicit expression for an output
value of ANN is given by following Equation.

Fig. 1: 
Typical threelayered feed forward neural networks with back
propagation algorithm 
Where:
w_{ji} 
= 
Weight in the hidden layer connecting the ith neuron in the
layer 
J_{th} 
= 
Neuron in the hidden neuron 
f_{h} 
= 
Activation function of the hidden neuron 
w_{kj} 
= 
Weight in the output layer connecting the jth neuron in the hidden
layer 
K_{th} 
= 
Neuron in the output layer 
b_{k} 
= 
Bias for the K_{th} output neuron 
f_{o} 
= 
Activation function for the output neuron 
The values of weights are different in the layers and can be updated during the process of network training. The activation function f_{h} that will be used is the log sigmoid function given by:
We need to determine the optimum of weights and biases that will yield the
least mean square value of the desired response ,
thus we must satisfy the following performance criterion:
where, E is statistical expectation operator and the factor 1.2 is included for convenience of Presentation (Birkunavyi, 2002).
NEURAL WAVELET NETWORK
The term wavelet as it implies means a little wave. This little wave must have at least a minimum oscillation and a fast decay to zero, in both the positive and negative directions, of its amplitude. This property is analogous to an admissibility condition of a function that is required for the wavelet transform (Thuillard, 2000). Sets of wavelets are employed to approximate a signal and the goal is to find a set of daughter wavelets constructed by a dilated and translated original wavelets or mother wavelets that best represent the signal. The daughter wavelets are generated from a single mother wavelet h (t) by dilation and translation:
where, a>0 is the dilation factor, b is the translation factor and c is correction factor (Lekutai, 1977).
Neural Wavelet networks employing wavelets as the activation functions recently have been researched as an alternative approach to the neural networks with sigmoid activation functions. The combination of wavelet theory and neural networks has lead to the development of wavelet networks. Wavelet networks are feed forward neural networks using wavelets as activation function. In wavelet etworks, both the position and the dilation of the wavelets are optimized besides the weights. Wavelet is another term to describe wavelet networks. Originally, Neural Wavelet networks did refer to neural networks using wavelets. In NWN, the position and dilation of the wavelets are fixed and the weights are optimized (Thuillard, 2000).
NWN BACK PROPAGATION (NWNBP)
Back Propagation (BP) neural network is now the most popular mapping neural network. But BP neural network has few problems such as trapping into local minima and slow convergence. Wavelets are a powerful tool signal analysis. They can approximately realize the timefrequency analysis using a mother wavelet. The mother wavelet has a square window in the timefrequency space. The size of the window can be freely variable by two parameters. Thus, wavelets can identify the localization of unknown signals at any level. Activation function of hidden layer neurons in BackPropagation network is a sigmoidal function shown in Fig. 2a. This type of activation function provides a global approximation on the search space. In this study we have substituted hidden layer sigmoidal activation function of Back Propagation neural network with POLYWOG and other wavelets.
Diagram of POLYWOG1 with a = 1 and b = 0, is shown in Fig. 1b.
This type of activation function provides a local approximation to the experimental data. In Back Propagation NWN (BPW), the position and dilation of the wavelets as activation function of hidden layer neurons are fixed and the weights of network are optimized using Scaled Conjugate Gradient (SCG) algorithm. In this study we suppose a = 1 and b = 0.2, 2.5,10

Fig. 2: 
(a) Sigmoidal function and (b) POLYWOG mother wavelet 
Therefore, BPW is a modified BackPropagation neural network with local approximation property and POLYWOG1 hidden layer neurons activation function. And adjusting the weights of network are done using Scaled Conjugate Gradient (SCG) algorithm. Structure of BPW is shown in Fig. 3.
PREPARATION OF DATA
First, time series data are divided to two sets; 80% for training and 20% for testing. Then we normalized data prepared the files associated MATLAB software for using them in wavelet neural network and neural network.
In this study, based on correlation analysis, proper input variables are selected from a set of potential inputs.
CASE STUDY APPLICATION
A multilayer artificial neural networks and a neurowavelet hybrid system were used. The proposed conjunction model based on use of wavelet transform and artificial neural networks for predicting runoff hydrograph has been applied to Halil River in Jiroft Dam located in the South of Iran (Fig. 4).
The Halil River watershed has five rainfall stations named BaftSoltani, BaftZirpol,
Meidan, Henjan and Cheshmeharoos. The stream flow measurement station named
Konaroeih which is close to Jiroft Dam, where natural flow measurements are
available, has been selected as a site to estimate runoff hydrograph. The rainfall
information of the stations, were chosen as input data. Therefore, hourly rainfall
from a gauging station of five stations (BaftSoltani, BaftZirpol, Meidan,
Henjan, Cheshmeharoos) are used as input data and hourly discharge from a gauging
station Konaroeih as output data for the artificial neural networks and a neurowavelet
hybrid approach. Using the input and output data to train the artificial neural
network model by trail and error procedure for different number of hidden layer(s),
it was concluded that the model with appropriate 781 structures has the best
topology.

Fig. 4: 
Halil River watershed and location of the rainfall and hydrometrics
stations 

Fig. 5: 
Comparison between calculated and observed discharge, flood
date December 8 through 17, 1376 

Fig. 6: 
Rainfall as an input information of the 5 station, flood
date December 8 through 17, 1376 
The model results and the actual data of the stream flow discharge of Konaroeih
station in (for
two flood date December 8 through 17, 1376 and December 25, 1376 through January
5, 1377) are shown in Fig. 58.

Fig. 7: 
Comparison between calculated and observed discharge, flood
date December 25, 1376 through January 5, 1377 

Fig. 8: 
Rainfall as an input information of the five stations, flood
date December 25, 1376 through January 5, 1377 
RESULTS
As shown in Table 1 different wave exert with various delay
and transmission. The considerable point is that the more coefficient related
to transmission was smaller, the better result have obtained. Of course we should
say this problem is not a general real.
Table 1: 
Results of the wavelet models 

Table 2: 
Comparison of the results between the ANN mode and wavelet
model 

Analysis with wavelet that is considered as an proposed of this research,
register of time record in train phase that this fact see when we use the software
but with attention to the probable changes in software and hardware situation,
this investigation needs excellence situation.
CONCLUSION
The use of neural wavelet network is a new method in improving the result of neural networks. In this paper use a multilayer network structure to predict daily stream flow. With the operation of NWN in Table 2 also compares the yield result with observance can say: NWN can be become a good alternative for ANN in predict daily stream flow. Suggested that other wavelet with large extend exert in changing the translation and dilation parameter and also NWN can uses in the other result science in water resources.