WikiPatents - Community Patent Review
Create Free Account  |  License or Sell Your Patent  |  WikiPatents Marketplace  |  WikiPatents Blog
Username:  Password:  
    
Advanced Search
System and method for using cached data at a local node after re-opening a file at a remote node in a distributed networking environment    
United States Patent4897781   
Link to this pagehttp://www.wikipatents.com/4897781.html
Inventor(s)Chang; Albert (Austin, TX); Neuman; Grover H. (Austin, TX); Shaheen-Gouda; Amal A. (Austin, TX); Smith; Todd A. (Austin, TX)
AbstractIn a distributed environment several data processing systems are interconnected across a network system. A distributed services program installed on the systems in the network allows the processors to access data files distributed across the various nodes of the network without regard to the location of the data file in the network. The processing system accessing file, referred to as the client processing system, utilizes a client cache within its operating system to store the data file. Utilizing the client cache minimizes the number of reads and writes that must go over the network to the server processing system where the file physically resides. The system and method of this invention prevents a process in the client processing system from accessing data in the client cache that has been modified at another node in the network. The blocks of data in the client cache are tested for validity in the client processing system by using modification times as measured by the server processing system. If the cache data blocks are determined to be valid, the data blocks are determined to be invalid, the data blocks are discarded, and the file is accessed from the server processing system.



 Title Information Submit all comments and votes
 
Patent Text Patent PDF Print Page Summary File History
Plain text PDF images Print Summary File History
Inventor     Chang; Albert (Austin, TX); Neuman; Grover H. (Austin, TX); Shaheen-Gouda; Amal A. (Austin, TX); Smith; Todd A. (Austin, TX)
Owner/Assignee     International Business Machines Corporation (Armonk, NY)
Patent assignment
All assignments
Publication Date     January 30, 1990
Application Number     07/014,899
PAIR File History     Application Data   Transaction History
Image File Wrapper   Patent Term   Fees
Litigation
Filing Date     February 13, 1987
US Classification     707/201
Int'l Classification     G06F 015/16 G06F 011/00
Examiner     Lee; Thomas C.
Assistant Examiner    
Attorney/Law Firm     Smith; Marilyn D.
Address
Parent Case    
Priority Data    
USPTO Field of Search     364/200 MS File 364/900 MS File 364/300
Patent Tags     cached data local node after re-opening file remote node distributed networking environment
   
Enter a comma (,) or semicolon (;) between multiple tag words/phrases.
Describe this patent:
 Amusing   
 Clever   
 Complex   
 Efficient   
 Historic   
 Important   
 Innovative   
 Interesting   
 Practical   
 Simple   
[no votes]
Patent WIKI

Share information and news about this patent, including information and news about the technology, inventors, company, ligation and licensing.

 References Submit all comments and votes
 
*references marked with an asterisk below are user-added references
 U.S. References
 
Add a new US reference:  
ReferenceRelevancyCommentsReferenceRelevancyComments
4718005
Feigenbaum
709/222
Jan,1988

[0 after 0 votes]
4718002
Carr
709/201
Jan,1988

[0 after 0 votes]
4714996
Gladney
707/203
Dec,1987

[0 after 0 votes]
4714995
Materna
707/201
Dec,1987

[0 after 0 votes]
4714992
Gladney
707/206
Dec,1987

[0 after 0 votes]
4710870
Blackwell
714/6
Dec,1987

[0 after 0 votes]
4686620
Ng
707/10
Aug,1987

[0 after 0 votes]
4635189
Kendall
707/10
Jan,1987

[0 after 0 votes]
4620276
Daniell
709/201
Oct,1986

[0 after 0 votes]
4432057
Daniell
707/8
Feb,1984

[0 after 0 votes]
 Foreign References
 Other References
 Market Review Submit all comments and votes
   
Market Size
Estimate the gross annual revenues of the relevant market sector:
> $10B
$5B - $10B
$2B - $5B
$500M - $2B
$100M - $500M
$10M - $100M
$1M - $10M
$500K - $1M
$100K - $500K
< $100K
[No votes]
$0
 
$0   $2.5B   $5B   $7.5B   $10B
Market Share
Estimate the percentage of the relevant market sector this invention will capture:
75% - 100%
50% - 74.99%
25% - 49.99%
10 - 24.99%
5 - 9.99%
2 - 4.99%
1 - 1.99%
< 1%
[No votes]
0.0%
 
0%   25%   50%   75%   100%
Reasonable Royalty
What percentage of gross sales should the inventor or assignee be paid?
75% - 100%
50% - 74.99%
25% - 49.99%
10 - 24.99%
5 - 9.99%
2 - 4.99%
1 - 1.99%
< 1%
[No votes]
0.0%
 
0%   25%   50%   75%   100%
Public's "Guesstimation" of Royalty Value
Market SizeN/A[No votes]
xMarket ShareN/A[No votes]
xReasonable RoyaltyN/A[No votes]

N/A

License Availablity
If you are NOT the owner or assignee, answer here:
Yes, license is available for purchase

No, license is not currently available



[No votes]
License Availablity
If you ARE the owner or assignee, answer here:
Yes, license is available for purchase

No, license is not currently available



[No votes]
Competitive Advantage
Does this invention have a significant competitive advantage over similar technologies?
Yes

No



[No votes]
Most helpful competitive advantage comment
[No comments]

Commercial Alternatives
Are there viable commercial alternatives for this invention?
Yes

No



[No votes]
Most helpful commercial alternative comment
[No comments]

 Technical Review Submit all comments and votes
 Claims Submit all comments and votes
 


We claim:

1. A system for determining the validity of data residing in a plurality of blocks in a client cache, of a client data processing system at a client mode, from a file residing in a server data processing system at a server node, wherein said server data processing system and said client data processing system are connected by means of a communications link, said system comprising:

means for determining, at the server node, a latest modification time of the file;

means, in the client data processing system, for saving, corresponding to each of the blocks in the client cache, the latest determined modification time of the file, received from the server data processing system, when the file is closed at the client;

means for recording, in the client data processing system, another determined latest modification time, received from the server data processing system, when said file is subsequently reopened at said client processing system; and

means coupled to said saving means and said recording means for comparing, in the client data processing system, the latest determined modification time corresponding to one of said blocks with the recorded another determined latest modification time to determine the validity of said block.

2. The system of claim 1 wherein said one of the blocks of the file in the client cache is accessed if said another determined latest modification time in the client processing system is equal to the latest determined modification time corresponding to each of said blocks.

3. The system of claim 1 wherein the file is accessed from the server processing system if the another determined latest modification time is not equal to the latest determined modification time corresponding to each of said blocks.

4. The system of claim 1 wherein the blocks of the file in the client cache are discarded whenever the blocks are determined to be invalid.

5. A system for using cached data residing in a plurality of blocks in a client cache, of a client data processing system at a client node, from a file residing in a server data processing system at a server node, wherein said server data processing system and said client data processing system are connected by means of a communications link, said system comprising:

first means for recording, at the server data processing system, a one last modification time for the file whenever the file at the server data processing system is modified;

means for saving, in said client data processing system, said one last modification time, received from the server data processing system, for each of the cached data blocks for the file in the client cache when the file is closed at the client data processing system;

second means for recording, in the client data processing system, another last modification time, received from the server data processing system, of the file at the server processing system, at a time of a subsequent reopen of the file in the client data processing system; and

means coupled to said saving means and said second recording means for using at least one of the blocks of the file in the client cache if said using means determines that said saved one last modification time for each of said blocks is equal to the recorded another last modification time in the client data processing system.

6. A method for using cached data residing in a plurality of blocks in a client cache, of a client processing system at a client mode, from a file residing in a server processing at a server node, wherein said server processing system and said client processing system are connected by means of a communications link, said method comprising the steps of:

recording by the client processing system a one last modification time, received from the server data processing system, of the file at the server processing system, for each of the cached data blocks in the client cache when the file is closed at the client processing system;

recording by the client processing system another last modification time, received from the server processing system, of the file at the server processing system during a subsequent reopen of the file in the client processing system; and

using at least one of the cached data blocks by a processing the client processing system if said process determines that said one last modification time for each of the at least one cached data blocks is equal to the another last modification time, of the file, recorded in the client processing system.

7. The method of claim 6 further comprising the step of accessing the file from the server processing system if the one last modification time and the another last modification time are not equal.

8. The method of claim 6 further comprising the step of discarding data in the cached data blocks when the last one modification time and the another last modification time are not equal.

9. A method for determining the validity of data residing in a cache block in a client processing system from a file residing in a server processing system, wherein the client processing system and the server processing system are connected by means of a communication link, said method comprising the steps of:

recording by the server processing system a latest modification time for the file at the server processing system whenever the file at the server processing system is updated;

recording by the client processing system the latest modification time, received from the server processing system, when the file in the client processing system is first opened;

allocating the cache block assigned to the file in the client cache in the client processing system when a block of the file is read from the server processing system;

recording by said client processing system the latest modification time recorded in the client processing system for each allocated cache block;

updating by said client processing system the latest modification time recorded in each said allocated cache block assigned to the file when the file is last closed at the client processing system; and

comparing by said client processing system the latest modification time recorded in the client processing system a first reopen to the updated latest modification time for each said allocated cache block, to determine the validity of each said allocated cache block.

10. The method of claim 9 further comprising the step of using, by a process in the client processing system, the cached block if the latest modification time recorded in the client processing system is the same as the latest modification time for the cache block.

11. The method of claim 9 further comprising the step of accessing the file from the server processing system, by the client processing system, if the latest modification time recorded in the client processing system is different from the latest modification time for the allocated cache block.

12. The method of claim 9 further comprising the step of discarding the data in the client cache block if the latest modification time recorded in the client processing system at the first reopen is not equal to the time for the cache block.

13. The method of claim 9 further comprising the step of replacing the data in the cache block with current data from the server if the latest modification time recorded in the client processing system at the first reopen is not equal to the latest modification time for the cache block.

14. The method of claim 9 wherein the data is discarded at any time the client processing determines the said compared modification times are not equal.

15. A system for keeping a plurality of blocks of data in a client cache, from a file residing at a server processing system, available for use by a client processing system when the file is reopened after a close of the file at the client processing system, wherein the client processing system and the server processing system are connected by means of a communication link, said system comprising:

first means for recording, in each of the cached blocks in the client processing system, a one last modification time, received from the server processing system, of the file at the server processing system, when the file is closed at the client processing system;

second means for recording, in the client processing system, another last modification time, received from the server processing system, of the file at the server processing system, when the file is subsequently reopened at the client processing system;

means coupled to said first and second recording means for comparing the modification times in the client processing system and the cached blocks to determine if the data in the cache blocks remained unmodified at the server processing system while the file at the client processing system was closed; and

means coupled to said comparing means for using the data from the cached blocks by a process in the client processing after the file is subsequently reopened in the client processing system if the compared modification times are equal.

16. A computer program having program code means for determining the validity of data residing in a plurality of blocks in a client cache, of a client data processing system at a client node, from a file residing in a server data processing system at a server node, wherein said server data processing system and said client data processing system are connected by means of a communications link, said computer program comprising:

first program code means, in the client data processing system, for causing a recording of correspondence between each of the blocks in the client cache and a latest determined modification time of the file, received from the server data processing system, when the file is closed at the client;

second program code means for causing a recording, in the client data processing system, of another determined latest modification time, received from the server data processing system, when said file is subsequently reopened at said client processing system; and

third program code means for causing a comparison, in the client data processing system, between the latest determined modification time corresponding to one of said blocks and the recorded another determined latest modification time to determine the validity of said block.
 Description Submit all comments and votes
 


CROSS REFERENCE TO RELATED APPLICATIONS

This application is related in subject matter to the following applications filed concurrently herewith and assigned to a common assignee:

ApplicatIon Ser. No. 07/014,884, currently copending and filed by D. W. Johnson, L. W. Henson, A. A. Shaheen-Gouda, and T. A. Smith for Negotiating Communication Conventions Between Nodes In A Network, now abandoned.

Ser. No. 07/014,897, currently copending and filed by D. W. Johnson, G. H. Neuman, C. H. Sauer, A. A. Shaheen-Gouda, and T. A. Smith for A System And Method For Accessing Remote Files In A Distributed Networking Environment.

Application Ser. No. 07/014, currently copending and filed by D. W. Johnson, A. A. Shaheen-Gouda, T. A. Smith for Distributed File Access Structure Lock.

Application Ser. No. 07/014,891, curently copending and filed by L. W. Henson, A. A. Shaheen-Gouda, and T. A. Smith for File and Record Locking Between Nodes in A Distributed Data Processing Environment.

Application Ser. No. 07/014,892, currently copending and filed by D. W. Johnson, L. K. Loucks, C. H. Sauer, and T. A. Smith for Single System Image Uniquely Defining An Environment For Use In A Data Processing System.

Application Ser. No. 07/014,888, currently copending and filed by D. W. Johnson, L. K. Loucks, A. A. Shaheen-Gouda for Interprocess Communication Queue Location Transparency.

Application Ser. No. 07/014,889, currently copending and filed by D. W. Johnson, A. A. Shaheen-Gouda, and T. A. Smith for Directory Cache Management In a Distributed Data Processing System.

The disclosures of the foregoing co-pending applications are incorporated herein by reference.

DESCRIPTION

1. Field of the Invention

This invention relates to processing systems connected through a network, and more particularly to the accessing of files between local and remote processing systems in a distributed networking environment.

2. Background Art

As shown in FIG. 1, a distributed networking environment 1 consists of two or more nodes A, B, C, connected through a communication link or a network 3. The network 3 can be either a local area network (LAN), or a wide area network (WAN). The latter consists of switched or leased teleprocessing (TP) connections to other nodes, or to a systems network architecture (SNA) network of systems.

At any of the nodes A, B, C, there may be a processing system 10A, 10B, 10C, such as a personal computer. Each of these processing systems 10A, 10B, 10C, may be a single user system or a multi-user system with the ability to use the network 3 to access files located at a remote node. For example, the processing system 10A at local node A, is able to access the files 5B, 5C at the remote nodes B, C.

The problems encountered in accessing a file at a remote nodes can be better understood by first examining how a stand-alone system accesses files. In a standalone system, such as 10 as shown in FIG. 2, a local buffer 12 in the operating system 11 is used to buffer the data transferred between the permanent storage 2, such as a hard file or a disk in a personal computer, and the user address space 14. The local buffer 12 in the operating system 11 is also referred to as a local cache or kernel buffer.

In the standalone system, the kernel buffer 12 is identified by blocks 15 which are designated as device number, and logical block number within the device. When a read system call 16 is issued, it is issued with a file descriptor of the file 5, and a byte range within the file 5, as shown in step 101, FIG. 3. The operating system 11 takes this information and converts it to device number, and logical block numbers in the device, step 102, FIG. 3. Then the operating system 11 reads the cache 12 according to the device number and logical block numbers, step 103.

Any data read from the disk 2 is kept in the cache block 15 until the cache block 15 is needed. Consequently, any successive read requests from an application 4 that is running on the processing system 10 for the same data previously read is accessed from the cache 12 and not the disk 2. Reading from the cache is less time consuming than going out to the fixed disk 2, accessing the correct disk sectors, and reading from the disk.

Similarly, data written from the application 4 is not saved immediately on the disk 2, but is written to the cache 12. This saves disk accesses if another write operation is issued to the same block. Modified data blocks in the cache 12 are saved on the disk 2 periodically.

Another use of the local cache in a stand-alone system is to hold valid data for a file even after the file is closed. If the file is re-opened while these blocks still exist in the cache, then no disk access is required for reading the blocks.

Use of a cache in a standalone system that utilizes an AIX.sup.1 (Advanced Interactive Executive) operating system improves the overall performance of the system since disk accessing is eliminated for successive reads and writes. Overall performance is enhanced because accessing permanent storage is slower and more expensive than accessing a cache.

.sup.1 AIX is a trademark of IBM Corporation.

In a distributed environment, as shown in FIG. 1, there are two ways the processing system 10C in local node C could read the file 5A from node A. In one way, the processing system 10C could copy the whole file 5A, and then read it as if it were a local file 5C residing at node C. Reading a file in this way creates a problem if another processing system 10B, 10A at another node A, B modifies the file 5A after the file 5A has been copied at node C. The processing system 10C would not have access to these latest modifications to the file 5A.

Another way for processing system 10C to access a file 5A at node A is to read one block N1 at a time as the processing system at node C requires it. A problem with this method is that every read has to go across the network communication link 3 to the node A where the file resides. Sending the data for every successive read is time consuming.

Accessing files across a network presents two competing problems as illustrated above. One problem involves the time required to transmit data across the network for successive reads and writes. On the other hand, if the file data is stored in the node to reduce network traffic, the file integrity may be lost. For example, if one of the several nodes is also writing to the file, the other nodes accessing the file may not be accessing the latest updated file that has just been written. As such, the file integrity is lost since a node may be accessing incorrect and outdated files.

Within this document, the term "server" will be used to indicate the node where the file is permanently stored, and the term "client" will be used to mean any other node having processes accessing the file. It is to be understood, however, that the term "server" does not mean a dedicated server as that term is used in some local area network systems. The distributed services system in which the invention is implemented is a truly distributed system supporting a wide variety of applications running at different nodes in the system which may access files located anywhere in the system.

The invention to be described hereinafter was implemented in a version of the UNIX.sup.2 operating system but may be used in other operating systems having characteristics similar to the UNIX operating system. The UNIX operating system was developed by Bell Telephone Laboratories, Inc., for use on a Digital Equipment Corporation (DEC) minicomputer but has become a popular operating system for a wide range of minicomputers and, more recently, microcomputers. One reason for this popularity is that the UNIX operating system is written in the C programming language, also developed at Bell Telephone Laboratories, rather than in assembly language so that it is not processor specific. Thus, compilers written for various machines to give them C capability make it possible to transport the UNIX operating system from one machine to another. Therefore, application programs written for the UNIX operating system environment are also portable from one machine to another. For more information on the UNIX operating system, the reader is referred to UNIX.TM. System, User's Manual, System V, published by Western Electric Co., January 1983. A good overview of the UNIX operating system is provided by Brian W. Kernighan and Rob Pike in their book entitled The Unix Programming Environment, published by Prentice-Hall (1984). A more detailed description of the design of the UNIX operating system is to be found in a book by Maurice J. Bach, Design of the Unix Operating System, published by Prentice-Hall (1986).

.sup.2 Developed and licensed by AT&T. UNIX is a registered trademark of AT&T in the U.S.A. and other countries.

AT&T Bell Labs has licensed a number of parties to use the UNIX operating system, and there are now several versions available. The most current version from AT&T is version 5.2. Another version known as the Berkeley version of the UNIX operating system was developed by the University of California at Berkeley. Microsoft, the publisher of the popular MS-DOS and PC-DOS operating systems for personal computers, has a version known under their trademark as XENIX. With the announcement of the IBM RT PC.sup.3 (RISC (reduced instruction set computer) Technology Personal Computer)) in 1985, IBM Corp. released a new operating system called AIX which is compatible at the application interface level with AT&T's UNIX operating system, version 5.2, and includes extensions to the UNIX operating system, version 5.2. For more description of the AIX operating system, the reader is referred to AIX Operating System Technical Reference, published by IBM Corp., First Edition (Nov. 1985).

.sup.3 RT and RT PC are trademarks of IBM Corporation.

The invention is specifically concerned with distributed data processing systems characterized by a plurality of processors interconnected in a network. As actually implemented, the invention runs on a plurality of IBM RT PCs interconnected by IBM's Systems Network Architecture (SNA), and more specifically SNA LU 6.2 Advanced Program to Program Communication (APPC). An Introduction To Advanced Program-To-Program Communication (APPC), Technical Bulletin by IBM International Systems Centers, July 1983, no. GG24-1584-0, and IBM RT PC SNA Access Method Guide and Reference, Aug. 15, 1986, are two documents that further describe SNA LU 6.2.

SNA uses as its link level Ethernet.sup.4 a local area network (LAN) developed by Xerox Corp., or SDLC (Synchronous Data Link Control). A simplified description of local area networks including the Ethernet local area network may be found in a book by Larry E. Jordan and Bruce Churchill entitled Communications and Networking for the IBM PC, published by Robert J. Brady (a Prentice-Hall company) (1983). A more definitive description of communications systems for computers, particularly of SNA and SDLC, is to be found in a book by R. J. Cypser entitled Communications Architecture for Distributed Systems, published by Addison-Wesley (1978). It will, however, be understood that the invention may be implemented using other and different computers than the IBM RT PC interconnected by other networks than the Ethernet local area network or IBM's SNA.

.sup.4 Ethernet is a trademark of Xerox Corporation.

As mentioned, the invention to be described hereinafter is directed to a distributed data processing system in a communication network. In this environment, each processor at a node in the network potentially may access all the files in the network no matter at which nodes the files may reside.

Other approaches to supporting a distributed data processing system in a UNIX operating system environment are known. For example, Sun Microsystems has released a Network File System (NFS) and Bell Laboratories has developed a Remote File System (RFS). The Sun Microsystems NFS has been described in a series of publications including S.R. Kleiman, "Vnodes: An Architecture for Multiple File System Types in Sun UNIX", Conference Proceedings, USENIX 1986 Summer Technical Conference and Exhibition, pp. 238 to 247; the Sun Network Filesystem", Conference Proceedings, Usenix 1985, pp. 119 to 130; Dan Walsh et al., "Overview of the Sun Network File System", pp. 117 to 124; JoMei Chang, "Status Monitor Provides Network Locking Service for NFS"; JoMei Chang, "SunNet", pp. 71 to 75; and Bradley Taylor, "Secure Networking in the Sun Environment", pp. 28 to 36. The AT&T RFS has also been described in a series of publications including Andrew P. Rifkin et al., "RFS Architectural Overview", USENIX Conference Proceedings, Atlanta, Georgia (June 1986), pp. 1 to 12; Richard Hamilton et al., "An Administrator's View of Remote File Sharing", pp. 1 to 9; Tom Houghton et al., "File Systems Switch", pp. 1 to 2; and David J. Olander et al., "A Framework for Networking in System V", pp. 1 to 8.

One feature of the distributed services system in which the subject invention is implemented which distinguishes it from the Sun Microsystems NFS, for example, is that Sun's approach was to design what is essentially a stateless machine. More specifically, the server in a distributed system may be designed to be stateless. This means that the server does not store any information about client nodes, including such information as which client nodes have a server file open, whether client processes have a file open in read.sub.-- only or read.sub.-- write modes, or whether a client has locks placed on byte ranges of the file. Such an implementation simplifies the design of the server because the server does not have to deal with error recovery situations which may arise when a client fails or goes off-line without properly informing the server that it is releasing its claim on server resources.

An entirely different approach was taken in the design of the distributed services system in which the present invention is implemented. More specifically, the distributed services system may be characterized as a "statefull implementation". A "statefull" server, such as that described here, does keep information about who is using its files and how the files are being used. This requires that the server have some way to detect the loss of contact with a client so that accumulated state information about that client can be discarded. The cache management strategies described here, however, cannot be implemented unless the server keeps such state information. The management of the cache is affected, as described below, by the number of client nodes which have issued requests to open a server file and the read/write modes of those opens.

SUMMARY OF THE INVENTION

It is therefore an object of this invention to improve the response time in accessing remote files.

It is a further object of this invention to maintain the file integrity in a distributed networking environment.

It is a further object to use a cache in both the server and client nodes to hold valid data when a file is closed in the client node.

To reduce the network traffic overhead when files at other nodes are accessed, and to preserve the file integrity, the accessing of the various files in a distributed networking environment are managed by file synchronization modes. A file is given a first synchronization mode if a file is open at only one node for either read or write access. A file is given a second synchronization mode if a file is opened for read only access at any node. A file is given a third synchronization mode if the file is open for read access in more than one node, and at least one node has the file open for write access.

If a file is in either the first or second synchronization mode, the client node, which is the node accessing the file, uses a cache within its operating system to store the file. All read and writes are then sent to this cache.

The system and method of this invention uses a cache in both the client and server nodes to hold valid data when a file is closed in the client node. Whether or not the client node reuses the client cache depends on whether or not the data in the client cache has been modified at another node during the time that the data file was closed at the client node. If the data has not been modified, the client cache can be accessed by reads and writes from processes in the client node without sacrificing file integrity. All data in the client cache is valid data. By using the client cache for access when a file has been opened after it had once been closed, network traffic overhead is reduced, and the read and write response time is decreased, thereby improving the response time.

To determine whether or not the data in the client cache has been modified at another node while the file was closed at the client node, the system of this invention comprises a surrogate inode in the client cache. The surrogate inode contains a field that identifies the server node, and also a file handle that identifies the file in that node. A surrogate inode is created in the client cache whenever the file is initially opened at a node, or is first opened after a last close. The last modification time of the file, as recorded by the server's clock, is written to the surrogate inode whenever a surrogate inode is created. The system of this invention also comprises a file modification time field in the cache data blocks that indicate the last modification time of the file at the server. The file modification time field in the cache data blocks are updated during the last close of the file at the client node.

The method of this invention comprises the following steps during the opening, reading, and closing of a file at a client node.

When an open for a file is issued in a client node, the surrogate inode table in the client processing system is scanned for an existing surrogate inode. If none exists, then a new surrogate inode is allocated and an open remote procedure call is sent to the server. When the open is complete at the server, the open acknowledgement from the server to the client will include the last modification time for the file. This time is recorded in the newly allocated surrogate inode for the file at the client node.

When new blocks of data of the file are read, a new cache block is allocated in the client cache. Each cache block contains the server node name, the file handle, and the last modification time from the surrogate inode.

When the file is opened for a second or subsequent time, and the surrogate inode table in the client processing system is scanned for an existing surrogate inode, a surrogate inode will already exist from a previous open. In this case, there is no change to the modification time, or to he surrogate inode. The last modification time on the data blocks are not changed with a second open, either.

During the last close of a file that is in ASYNCH mode, the following steps occur. First, the client sends a close to the server. Then, upon receiving the close request from the client, the server sends an acknowledgement of the close to the client. With the close acknowledgement, the server sends the last time that the file was modified to the client. The server may have to go to the disk at the server to get this last modification time. The client then deallocates the surrogate inode, and scans all of the remote cache buffers for blocks which have the server node name and file handle for the file being closed. The client then changes all of the last modification times in the corresponding cache blocks to the one received from the server with the close acknowledgement.

Whenever a block is being read from the client cache, the time in the surrogate inode is compared with the time in the cache data block. A time in the surrogate inode that is greater than the time in the cache data blocks indicates that the data in the client cache has been modified while the data file has been closed at the client node. In this case, the client node must go over the network to the server to get the last modified data. To maintain file integrity, all blocks of data for the file in the client cache must be invalidated.

A time in the surrogate inode that is the same time as recorded in the cache data blocks indicates that the data in the client cache is still valid. No other node has modified this data while the file was closed at the client node. In this case, processes within the client node can use the block of data in the client cache without going across the network to the server where the file actually resides.

BRIEF DESCRIPTION OF THE DRAWING

FIG. 1 shows three processing systems connected in a networking environment as known in the art.

FIG. 2 shows a stand-alone processing system using a kernel buffer as known in the art.

FIG. 3 shows a flow chart of a read to the kernel buffer in a stand-alone system as known in the art.

FIG. 4 shows three distributed processing systems connected in a network for accessing files across the network with client and server caches.

FIG. 5 shows a client and server node having client and server caches, respectively in READONLY or ASYNC synchronization mode.

FIG. 6 shows the three synchronization modes used for managing the use of client and server caches in a distributed networking environment.

FIG. 7 shows the transitions between the three synchronization modes.

FIG. 8 shows a client accessing a file at the server in FULLSYNC s.sub.-- mode.

FIG. 9 shows the steps during a read when a client cache is used, and when the client cache is not used.

FIG. 10 shows a distributed networking environment wherein the client cache has a surrogate inode and time bits in the cache blocks for determining the validity of the data in the cache data blocks.

FIG. 11 shows the steps of the present invention during the opening, reading, and closing of the file at the client node.

DESCRIPTION OF THE PREFERRED EMBODIMENT

In the present invention as shown in FIG. 4, a local cache 12A, 12B, 12C, exists at every node A,B,C. If file 5 permanently resides at node A on disk 2A, node A is referred to as the server. At the server A, use of the cache 12A by local processes 13A executing at the server node A is as that in a stand-alone system as discussed above in the Background Art.

However, remote processes 13B, 13C executing at nodes B, C, access the file 5 through a two step caching scheme using a server cache and a client cache as shown more clearly in FIG. 5. The server node A gets blocks of file 5 from disk 2A and stores it in the server cache 12A. Client node B goes out over the network 3 and gets blocks of file 5 from the server cache 12A. Client node B stores the blocks of the file 5 as it existed in the server cache 12A into the client cache 12B. When the user address space 14B of client node B seeks data from file 5, in ASYNCH or READONLY sync mode, the client cache 12B is accessed instead of going across the network 3 for each access. Using the client cache 12B to access a remote file 5 can significantly improve the performance since it can save network traffic and overhead.

The use of the client cache 12B and server cache 12A are managed in a distributed environment to achieve high performance while preserving the file access semantics at the application program level. This allows existing programs which run on a stand-alone system to run on a distributed system without any modification.

The file access semantics preserves a file's integrity as it is being opened by different processes that issue read and write system calls to access and modify the file. The file access semantics require that only one I/O operation is allowed on any byte range at any time, and once an I/O operation starts, it cannot be pre-empted by any other I/O operation to the same byte range of the file.

An example of this is given by referring again to FIG. 5. If process 131 issues a write system call to a byte range N1-N2 in file 5, the write system call can only be executed when the entire byte range N1-N2 is available for access by process 131, and no read operation involving the byte range N1-N2 is being executed. During the execution of the write system call, all other operations involving the byte range N1-N2 in file 5 are suspended until the write is completed. The write is not completed until the bytes are written to the local cache 12A. When a write request is complete, the written data in the cache 12A is visible to any subsequent read operation by any of the other processes 131-13N.

Another requirement of file access semantics is that when a file byte range such as N1-N2, which can be a record or a set of related records accessed by the same I/O operation, is visible to a reading process, the file byte range N1-N2 must always have a consistent set of data reflecting the last update to this range. This range is never available for access while a write operation is being executed. In this way the next read issued by a process will read the data just written and not the old outdated data.

In a distributed networking environment of this invention as shown in FIG. 5, the execution of read and write system calls from different application synchronized such that the file access semantics as discussed above are preserved. Synchronization is guaranteed by utilizing various cache synchronization (sync) modes. For a specific file 5, the I/O calls are synchronized by either the client B or the server A depending on the location of the processes 131-13N, 231-231N which have the file 5 open for access, and the sync mode.

The three synchronization modes are shown in FIG. <6, and are described with reference to FIG. 4. The first mode 41 is referred to as ASYNCH s.sub.-- mode, or asynchronous mode. The file 5 operates in this mode 41 if the file 5 is open for read/write access by processes 13C executing at only one client remote node C, as shown in block 44, FIG. 6. In this mode 41, all of the control is in the client node C. Both the server cache 12A and client cache 12C are used for these read/write operations. A read or write operation requires access to the server cache 12A only if it cannot be satisfied from the client cache 12C. Modified blocks at the client 12C are written to the server 12A by the periodic sync operation, or when the file 5 is closed by all processes 13C in the client node C, or when a block must be written in order to make room for other data being brought into the cache. Additionally, modified blocks are written to the

server when the file changes from ASYNCH s.sub.-- mode to FULLSYNC s.sub.-- mode.

A second mode 42 is READONLY s.sub.-- mode. The READONLY s.sub.-- mode 42 is used for files 5 that are open for read only access from processes 13C in only one node C, or from processes 13B, 13C in more than one node B, C, as shown in block 45, FIG. 6. In this mode 42, the server cache 12A and the client caches 12B and/or 12C are used. The read request is issued for a block or more at a time. Every other read request from the same client, either B or C, to the specific block does not go to the server 12. Instead, it is read from the respective client cache, either B or C. In other words, a read operation does not require access to the server 12A if it can be satisfied from the client cache 12C or 12B. In summary, the file 5 operates in mode 42 if the file 5 is open for read only access by any of the processes 13A, 13B, 13C, in any of the nodes A,B,C.

A third mode 43 is FULLSYNCH s.sub.-- mode. The FULLSYNC s.sub.-- mode 43 is used for files 5 open for write access by a process 13A in the server node A, as shown by block 48, FIG. 6. This sync mode 43 is also used if the file 5 is open in the server node A and at least one other node B, C, and at least one process 13A, 13B, or 13C has the file 5 open for write access, as shown by blocks 46,47, FIG. 6. In general, if more than one node has