|
Description  |
|
|
TECHNICAL FIELD
The present invention relates generally to high density random access
memory (RAM) semiconductor circuits, and more particularly to clock
circuits therefor.
BACKGROUND OF THE INVENTION
Synchronous random access memories (RAMs) are commonly employed in computer
systems to operate as cache RAM for a microprocessor. Synchronous RAMs
(usually static RAMs or SRAMs) operate according to the timing provided by
a system clock. For example, read operations begin with an address being
latched on a first clock cycle. Once the address is latched, row and
column decoding functions are performed and data are presented to at an
input/output (I/O) latch where they are latched on a subsequent clock
cycle. Other synchronous functions include "burst" modes wherein,
according to control inputs provided, a range of column addresses can be
generated on every clock cycle following the initial latching of the
address. To accomplish synchronous operation, the system clock is received
by the RAM device and distributed to various circuits within, such as the
address latch and I/O latch described above.
While synchronous memories provide many advantages, increasing clock speeds
and advances in fabrication technology can adversely affect synchronous
memory operation. As is well known in the art, long-distance interconnects
and delays due to reduced geometry active devices can contribute to
overall delay in the propagation of signals within a memory device. Such
delays are always present between the point at which the clock signal is
received and the point at which the clock signal is applied, and result in
a phase shift between certain portions the memory device and the actual
system clock. As semiconductor chips get larger with increased
integration, and operating frequencies get higher, the percentage of the
clock cycle lost to delays increases. Such a phase shift, if severe
enough, results in timing errors as portions of the memory device are no
longer synchronous with the rest of the system.
A prior art example of an internal clock timing arrangement is set forth in
FIG. 1a. The example illustrates, in block diagram form, the timing scheme
for synchronous burst SRAM 10. The SRAM 10 includes a clock routing
circuit 12, an address latch 14, an X-Y decoding section 16, a burst
counter 18, an I/O latch 20, an I/O buffer 22, and an output pad 24. The
clock routing circuit 12 receives an external clock (CLK.sub.EX) and
drives the address latch 14, the burst counter 18, and the I/O latch 20.
For the example in FIG. 1a, a delay element (.delta.) is shown between the
external clock input and the I/O latch 20 that is representative of the
propagation delay or skew therebetween. The delay results in a delayed
clock signal (CLK.sub.I/O) being received at the input of the I/O latch
20.
Set forth in FIGS. 1b and 1c are two timing diagrams illustrating the
timing signals of CLK.sub.EX, CLK.sub.I/O, and the output data (shown as
Dout) for the SRAM 10 of FIG. 1a. The SRAM 10 of FIG. 1a is synchronous
with the rising clock edge of CLK.sub.EX and the total delay introduced
into CLK.sub.I/O is shown as "td". If the size of the delay td is too
great, timing errors can occur. For example, it is known in the prior art
to latch a memory address on a first clock edge and then internally latch
the corresponding output data on the following internal clock edge. After
a time tz, the output data will be available for sampling at the output on
a third clock edge. In order for proper sampling of the output data to
occur the output data must have a sufficient setup time prior to the third
external clock edge, "ts", and a sufficient hold time, "th", after the
third external clock edge. FIG. 1b illustrates an example where td is
small enough that the output data have a sufficient setup time. In
contrast, FIG. 1c illustrates an example where td is large enough that the
output data have an insufficient setup time.
While FIGS. 1a-1c set forth an example of a delay between the external
clock and the latching clock signal at the input to the I/O latch, it is
understood that similar delays can occur for other clocked portions of the
RAM. For example, an unwanted delay between the external clock input and
the input to the address latch would shift the actual address latching
point. As a result, the address may not have an adequate hold time to be
valid.
Clock skew is commonly reduced by efficient layout designs that minimize
long-distance interconnects carrying clock signals. U.S. Pat. No.
5,367,490 issued to Akimoto et al. on Nov. 22, 1994 illustrates an SRAM
design with an advantageous layout, as well as an apparatus for providing
a read pulse with a variable delay and duration to maximize timing
efficiency.
Other specialized synchronous memory architectures, such as "pipeline"
designs provide increased performance, but are still limited by the
propagation delay of the clock circuits therein.
Other memory designs, unrelated to synchronous RAMs, have addressed
variable data signals with phase locked loop designs. For example, a phase
locked input port for the reception of asynchronous serial data in a
two-port RAM is set forth in U.S. Pat. No. 5,260,909 issued to David
Davidian on Nov. 9, 1993.
While a number of approaches to improving clock performance in
semiconductor devices exist in the prior art, these approaches all have
limitations as to the amount of improvement they can provide. Thus it is
desirable to provide a circuit to improve clock performance beyond the
methods described above.
SUMMARY OF THE INVENTION
It is an object of the present invention to provide a clock circuit that
compensates for delays between an external clock signal and clock
controlled circuits within a synchronous memory device.
According to the present invention a synchronous monolithic memory device
includes a clock circuit for distributing an internal clock signal to at
least one clock controlled circuit within the memory device. The clock
circuit includes a phase correction subcircuit and a clock routing
subcircuit. The phase correction circuit includes a modified phase locked
loop (PLL) with a delay feedback element. The modified PLL provides an
internal clock signal that is derived from an external clock input and a
feedback input. The internal clock signal is phase shifted to compensate
for the delays introduced by the clock routing subcircuit.
According to one aspect of the invention the feedback signal is the
internal clock signal tapped from a midway point on the clock routing
subcircuit.
According to another aspect of the invention the feedback signal is a
feedback loop from the internal clock output to the feedback input. The
feedback loop includes a delay element analogous to the clock routing
subcircuit between the output of the modified PLL and a selected clock
controlled circuit within the memory device.
Other objects and advantages of the invention will become apparent in light
of the following description thereof.
BRIEF DESCRIPTION OF THE DRAWINGS
FIG. 1a is a block diagram illustrating the timing circuit of a prior art
synchronous burst SRAM.
FIG. 1b is a timing diagram illustrating the signals of the prior art SRAM
wherein no timing errors occur.
FIG. 1c is a timing diagram illustrating the signals of the prior art SRAM
wherein the delay introduced by the clock routing circuit results in a
timing error.
FIG. 2 is a block diagram illustrating an embodiment of the present
invention.
FIG. 3a is a timing diagram illustrating the clock signals of an embodiment
of the present invention before phase correction.
FIG. 3b is a timing diagram illustrating the clock signals of an embodiment
6f the present invention after phase correction.
FIG. 4 is top plan view illustrating the layout of an embodiment of the
present invention.
FIG. 5 is a top plan view illustrating an alternate embodiment of the
present invention.
DETAILED DESCRIPTION OF THE EMBODIMENTS
FIG. 2 illustrates, generally, a synchronous burst SRAM 110 according to
the present invention. The synchronous burst SRAM 110 includes a clock
circuit 112 that is coupled to an address latch circuit 114, a burst
counter circuit 116, an I/O latch circuit 118, and a clock buffer circuit
120. In the preferred embodiment, the address latch circuit 114 is
connected to a control circuit 122 which controls an X-decoder 124 and
Y-decoder 126. The Y-decoder 126 provides an output to the I/O latch
circuit 118. The I/O latch circuit 118 is connected to an I/O buffer 128
which provides a signal to an output pad 130.
The synchronous SRAM 110 receives a number of externally provided signals.
As shown in FIG. 2, the address latch circuit 114 receives an address
(shown as ADD), the I/O buffer 128 receives an output enable signal (shown
as OE), and the clock buffer circuit 120 receives a system clock signal
(shown as CLK.sub.SYS).
The clock circuit 112 includes a clock routing subcircuit 132 and a phase
correction subcircuit 134. As illustrated in FIG. 2, the two subcircuits
(132 and 134) overlap at a delay subcircuit 136. The phase correction
subcircuit 134 is essentially a phase locked loop (PLL) with the delay
subcircuit 136 functioning as a feedback element. The phase correction
subcircuit 134 of the embodiment is shown to include a phase comparator
138 which receives two inputs. The first input is the CLK.sub.EX signal
from the clock buffer circuit 120. The second input is a feedback input
that is a delayed clock signal (CLK.sub.DEL). The phase comparator 138
output is connected to a charge pump 140 which is coupled to a loop filter
142. The loop filter 142 provides an output to a voltage controlled
oscillator (VCO) 144. The VCO 144 outputs an internal clock signal (shown
as CLK.sub.INT). The CLK.sub.INT signal is used to clock the address latch
circuit 114, burst counter 116 and the I/O latch circuit 118. CLK.sub.INT
appears at the input to the I/O latch circuit 118 as CLK.sub.I/O.
In the embodiment of FIG. 2, the phase correction subcircuit 134 includes a
feedback path from the VCO 144 output to the input of the phase comparator
138. Within the feedback path is a first delay element 146a and a second
delay element 146b. A third delay element 146c is disposed between the
first delay element 146a and the input to the I/O latch circuit 118. The
delay elements 146a and 146c are representative of the delay and skew
inherent in the physical layout of the clock circuit 112 on the integrated
circuit die. The first delay element 146a is the delay and/or skew caused
by the propagation of the CLK.sub.INT signal from the output of the VCO
144 to a feedback node N1. The second delay element 146b is representative
of the delay and/or skew resulting from feeding the CLK.sub.INT signal
from node N1 back to the phase comparator 138. The third delay element
146c is the delay and/or skew caused by the propagation of CLK.sub.INT
from node N1 to the I/O latch circuit 118. Delay element 146b is
fabricated to be an approximate equivalent of 146c, i.e. the delay and/or
skew caused by 146b is equivalent to the delay/skew of 146c. In this
manner the clock signal received at the input to the phase comparator 138
has the same delay and skew as that received at the I/O latch circuit 118.
In operation, CLK.sub.SYS is received by clock buffer circuit 120 which
outputs the CLK.sub.EX signal. CLK.sub.EX and CLK.sub.DEL are received by
the phase comparator 138. In the preferred embodiment, the phase
comparator 138 is a sequential phase comparator of CMOS design. As is well
known in the art, the phase comparator 138 provides a pulse output with an
average differential voltage (vd) that is negative if CLK.sub.DEL lags
CLK.sub.EX, or positive is if CLK.sub.DEL leads CLK.sub.EX. The pulse
output is applied to the charge pump 140. The charge pump 140 is coupled
to the loop filter 142 which provides an error voltage (ve) output. The
error voltage is received by the VCO 144 which will increase or decrease
its output signal (CLK.sub.INT) according to ve. Initially, the delay/skew
introduced by the first and second delay elements (146a and 146b) will
result in CLK.sub.DEL lagging CLK.sub.EX. The phase correction subcircuit
134 compensates for the delay/skew by increasing the frequency of the
CLK.sub.INT signal at the VCO 144 output so that CLK.sub.INT leads the
CLK.sub.EX signal going into the delay elements (146a and 146b). When
CLK.sub.DEL is in phase with the CLK.sub.EX at the input to the phase
comparator 138 the phase correction subcircuit 134 functions as a PLL in
lock. Because the second and third delay elements (146b and 146c) are
designed to have the same phase delay/skew effect on a clock signal, the
CLK.sub.INT as received by the I/O latch circuit 118 (CLK.sub.I/O) is also
in phase with CLK.sub.EX.
FIGS. 3a and 3b are timing diagrams illustrating signals CLK.sub.EX,
CLK.sub.INT, CLK.sub.DEL, as well as CLK.sub.I/O. FIG. 3a illustrates the
initial state of the clock signals while FIG. 3b illustrates the clock
signals as corrected by the phase correction subcircuit 134. As set forth
in FIG. 3a, initially, CLK.sub.INT is synchronous with CLK.sub.EX, while
CLK.sub.DEL and CLK.sub.I/O lag CLK.sub.EX by an amount td. As represented
by FIG. 3b the phase correction subcircuit 134 functions as a modified PLL
that, when in "lock", phase shifts CLK.sub.INT forward by tl which is
approximately equal to td. Accordingly, CLK.sub.DEL and CLK.sub.I/O are
phase shifted forward so as to be synchronous with CLK.sub.EX.
FIG. 4 is a top plan view illustrating the layout of one embodiment of the
present invention. It is understood that the proportions are exaggerated
for explanatory purposes. As shown in the figure the synchronous SRAM 110
is fabricated on a semiconductor "chip" that includes a first side 148 and
an opposing second side 150. The clock buffer circuit 120 and phase
correction subcircuit 134 are situated on the first side 148, while the
clock routing subcircuit extends from the phase correction subcircuit 134,
across the chip, to the second side 150. The clock routing subcircuit 132
is generally symmetrical and provides the internal clock signal to four
I/O latch circuits 118 as well as address latch circuits 114.
Delay element 146a is shown to be the long-distance interconnect between
the phase correction subcircuit 134 and node N1. Delay element 146b is
shown to be a feedback interconnect from N1 back to the phase correction
subcircuit 134. Delay element 146c is the section of long-distance
interconnect stretching from node N1 to where the clock routing subcircuit
132 terminates at the address latch circuit 114. As mentioned previously,
delay element 146b is designed to have the same approximate clock signal
response as that of delay element 146c. In the embodiment illustrated in
FIG. 5, this is accomplished by delay element the 146b being composed of
the same fabrication elements as delay element 146c. Both delay elements
(146b and 146c) are fabricated from the same metallization layer, have the
same line width, and are routed the same manner, i.e. over the same
underlying material (in the preferred embodiment an interlayer dielectric
and field oxide). Parasitic capacitance between the long-distance
interconnects is minimized by providing sufficient layout distance between
the two lines. However, one skilled in the art would recognize that the
interconnects could be closer in proximity with the resulting parasitic
capacitance compensated for by moving the position of node N1 accordingly.
Referring once again to FIG. 4 it is shown that a number of I/O latch
circuits 118 and address latch circuits 114 are situated along the clock
routing subcircuit 132. As a result, the delay induced by the clock
routing subcircuit 132 grows in proportion to the distance between the
latch circuit (118 or 114) and the clock input. For example, the phase lag
experienced by the I/O latch circuits 118 closest to the phase correction
subcircuit 134, are much smaller than that experienced by the address
latch circuits 114. If these differences are severe enough, it is possible
to overcompensate for clock delay with phase correction subcircuit 134.
This can occur when CLK.sub.INT leads CLK.sub.EX by such an amount that
clocked circuits closest to the phase correction subcircuit 134 are
clocked too early, resulting in timing errors. For example, the I/O latch
circuits 118a closest to the phase correction circuit 134 could latch
before the correct data are provided by decoder operation. Accordingly,
proper timing margins for clocked circuits should be observed in
establishing the value of the delay elements 146.
An alternate embodiment of the present invention is set forth in FIG. 5
which is a partial top plan view of a synchronous SRAM 110. The alternate
embodiment has a similar configuration to that of the embodiment set forth
in FIG. 4, and to that extent, identical items to those appearing in the
FIG. 4 will be referred to by the same general reference number.
Referring once again to FIG. 2, the embodiment of FIG. 5 illustrates the
alternate embodiment wherein the first delay element 146a is eliminated
(its effect on the clock signal is negligible) by moving node N1 to the
output of the phase correction subcircuit 134. In this configuration the
second delay element 146b functions as a single feedback element. As shown
in FIG. 5, in the alternate embodiment, the feedback element 146b includes
a "snake" pattern of interconnect having the equivalent resistance and
capacitance as delay element 146c. Unlike the example of FIG. 4, delay
element 146c is now the long-distance interconnect extending from the
output of the phase correction subcircuit 134 to the opposing second side
150 of the synchronous SRAM 110.
While the embodiments of FIGS. 4 and 5 both illustrate synchronous SRAMs
110 in which the same modified clock (CLK.sub.INT) is distributed to a
number of clocked circuits, it is understood that the phase correction
subcircuit 134 could be used to apply a modified (phase shifted forward)
clock signal to only those clocked circuits experiencing the greatest
delay/skew (in FIG. 2, the latching circuits toward the second side 150 of
the chip). A conventional clock routing circuit using the CLK.sub.EX
signal could be used to provide a clock to the remaining clocked circuits.
Alternatively, for clocked circuits that receive a CLK.sub.INT signal that
is phase shifted too far forward, delay elements could be introduced to
correct the timing.
As will be apparent to one skilled in the art, the invention has been
described in connection with its preferred embodiments, and may be
changed, and other embodiments derived, without departing from the spirit
and scope of the invention. Accordingly, the scope of the claimed
invention is better defined with reference to the following claims.
* * * * *
|
|
|
|
|
Description  |
|